m l^nra■RNAT^o^^4L application published under the patent coopebation treaty (pct) 



(1?) World Intdtectudi Property Orgdttizstion 
Internationa! Bereaw 



liiliiiiilililiiiiiiiiiillliiiii 



(43) Infernatfooat Fuftikatba Date 
22 August 2002 (22.08.2002) 



PCX 



<10) ]Bt«m8tto»at Pttt^Hcallon Nuntljer 

WO 02/064843 Al 



(Si) Internatioaat Patent Ctessifieattoa": 
C07D24l/36,CI2N 35/00 



(2n IntecnatifKial Application N»i»ber: PCr/US{,»2/03703 

(12) tatef-riatioaal F«i»g Date; 8 y*i)njarj' 2062 (08.012002) 

(25) F«K»gL8»gtiage; BsgHsh 

(2^) Publicstien Latt£a8g«: BngiM 

(38) Priority Data; 

mnmm 9Febr«ar>'20{)} (09,02.200n US 

60/34932 1 15 imimy 2002 (1 5,t>l .2002') US 

{7J) Apt>tiea»t (far aJi d&xmiaied StaSes except VSJi THE 

I REGEN1S OF THE U NIVERSITY OF CALIFORNIA 

: fliS/US]; lUl Frafiklin Street, 12th "Plcor, Oakland, CA 

s 94.607 -5200 (US). 

5 (71) Applicants »«S 

I (72) iHventBrs: HAYGOOft, Margtt [US*US]; 2390 King 

! Anhm Coart, U JolJa, CA 92037 (US). SALOMON, 

: Christine fUS/USj; 9nOF Regents Road, La loJJa, CA 

I 92037 (US;. FAlfLKNER, Mta, 0, {IfSAJSJ; 5915 

I Desert View Driw, La JoUs. CA 9203 ? (US). 



C12R t/00, (81) ftesigtwtwl States (nauomi}; AE, AG. AL. AjM, AX, Al), 
AZ, BA, BB, BO. BR, BY, BZ, CA. CH, CN, CO, CR, CU. 
CZ, DE, DK, DM, DZ. EC. ES., BS, Pt, GB, GD. GE, GH. 
GM. IIR, W;, SIX IL. JK, LI, -TP, KH. KG, K?. KK, KZ, E.C. 
LK, LR, LS. LT, LU, LV. MA NTD, MG, MK, MN, ^5V, 
MX. MZ. .NO, .\'Z, m, ?!... PT. RtO. RTJ, Slh SE. SO, 
SI. SK, &U T.T, TM, TN, TR, TT. TZ, UA, UG, US, XSZ, 
VN, YU, ZA, ZM, ZW. 



{U) Desigtiated States .rregn.<.'!c(.i: ARIPO paten! (GH, GM. 
KE. LS, MW, MZ, Si>, SL, SZ, TZ. UO, ZM. ZWj. 
euTBSisnpatsfsi ^AM, AZ, BY. KG, KZ.MO, Kt^ D. T?vi), 
Butopesan paten: (AT, BE, CH, CY. DE, 1>K, BS. il. m, 
GE, OR. IE, IL LU, MC. PT', S£, TR). OAPl patent 
(EF, m, CR CG, a, CM, GA. GN, GQ, GW, Ml.., ?vfR. 
NH.SHT»,TG). 

I>tib!Uh«l: 

— wJft internatioml search repon 

before tfte expsnstian of the time limii fos- amendi>ig (te 

ciaims mxi to be r^puhUsfmd in (k<> svunl <;/' r&ceipl of 
(!m£ncimef7is 



For twiy-kilsr codes andoiher abbrin'iatiom, refer (o ihe "Gstd- 
anceNettes m Codes andAbbreviatioits" cppeaHng adhs begin- 
King of each reguieir issm of the PCTGaxem. 



5 (74) Ageutt VIKa^lNS, Ann, S.; Scbwegtnan, LatiUbeig. 
: Woessuer & Klaiii, P.O. Bo.x 293S, Mir.«eapohs, Mia- 
s ne»ta55*2(US). 



I (54) rttfe; SCTBWASCIDIN FAMB..YCOMJ'OX)NDS: COMPOSmONS .AND MBmODS 




► |57> Abstract: The js*senS 



provides it bactarial sytnbjonl of EcteiftasciiUa turbincUa, and meaiods for the synifaasis .->f sn 



wo 



PCm!S(!2/e37«3 



ICTEINASCroiN FAJVHLY COMPOUNDS: 



COMPOSITIONS Am ^'lETHODS 



Teciiateal Field 



5 



The present invention generally relates to the production of bioacdye 



compoimds isucli as eotemascidins and raefliio<is of makmg such eatemascidias 
Hsiag a variety of methods. In particuiar, bactedal symbionts of the asddian 
Ectmnascidia turhimta, its proteins or genes can be used m such methods. 



'Hie ectemascidiJis are a group of bioactive tetrahydroisoqumoline 
alkaloids isolated from the Caribbean maiigim'e ^scidim Ecteinascidia turbinata 
(Rittehart et ai. , 1990 arid Wright et aL, 1990). The most active of tiiese 
compounds, ectelnascidin 743 (Bt-743), (Corapouttd 1) shows remarkable 

1 5 |)otency in both in vitm aad m vivo bioassays, and possesses picomoiai activitj^ 
against hr^sast, colon, non-small cell lung carcinoma (NSCLC), renal, ovariaa, 
and mdaiToma ceU lines (Zewdi-Foote et al., 1999). The potent activity of Et- 
743 and itB clinical potmtial places Et-743 in a small group of marine 
invertebrate-derived compounds, inclnding diseodermoHde (Gunasekera et al,, 

20 1990), bryostatins (Pettit et al., 1982) and elentherobM (JLindeJ et al, 1997). 



10 



Background of the InveatioB 




1 ecteinascidia743 (Et 743) 



2 sa&amycjie A-C 

3 cyaaosaftaciaR,K3Nf,R2==H,R3<Md2CH3 
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6 jonanycin miieroi 8 miiaosamycm 

lit-743 binds to tlie imnox gi-oove of DIN A and speciiScally ailcyktes ftie 
K2 of guanine (Poimnier et ai., 1996 and Moore et al, 1997), The drug differs 
fi-om tbe other jiiinor gj-oove-interactive compounds because it induces a beiid In 
5 th& DNA lielix towarf the major giwve (Zewail-Foote et ai., 1999). Et-743 
shows ^ti-tumor activity witto IC55S m the naBomolsr range and is related to a 
large aamber of less potent compounds (Rinehart et al., 1990 and Fairclofli et al,, 
1996). Et-743 is ia phase I climcal trials in the United States and Europe and is 
in phase H clinical trials in France (Valcai et al, 1988 and XUnehart, 2001), 
10 Mechamsticaily, Et-743 disrupts the microtubule network and binds 

siAstantially irreversibly to 5 -GGG, 9~G<jC and S^-AGC DNA sequences that 
leads to cell cycle arrest in the late S and G2,'M phase (Garcia-Rocha et al, 1 996; 
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Moire et al., 1 997; Poramier et al, 1996; Morre et al., 1998; and Fairdoth et al., 
1996). 

Siracmrally, Et-743 k similar to tlie renieramydns (He et at, 1,989 and 
Frincke et al, 1982) from the mmm sponge Renfera sp: It shows remarkable 
5 structe-al siffiilarity to the less active satramycm antibiotics, (Compounds 2-3), 
respectively, isoistsd from the bacteria Sn-eptomyces lavendulae (Arai et al, 
19S3), Psmd&fnonas Jluorescem (Meyers et al., 1983) md Myxococats xamkus 
Qxsclak et al,> 1988), as well as the rexiietamycins (Compound 4) and aibrostatin 
4 (Compound 5) isolated from the marine sponges Jteniera sp, and 

10 Cribrochalim respectively (Frincke et ai, 1982 and Pettit et al,, 2000). 
Sa&amycin dmvates are produced in calture and synthetic rentes exist for 
derivatives of most of the isoqninolone afitibiotics, 

Joramycin (Compound 6) was isolated from the Indian midibranch 
Jarimna funebris that was foimd "associated with" a blue sponge, identified ^ 

15 Oceitncqna sp. (Fontana et al, 2000). The monomeric isoquitiolines 

(Compottn(fe 7 and S) were also found in the Oceant^ia sponge, as well as 
cttlt«re broths of 5. lavendulae, and a variety of othar sponges (Frincke and 
FanUaier, 19S2; McKee and Ireland, 1987; Pamneswiuran et al, 1998), It is 
believed that compotaids such as Compounds 7 and 8 are the result of oxidative 

20 cleavage of the dhneric compouncfe and probably arise from the isolation and 
extraction procedui-e (Princke and Faulknen 1982). Et-743 differs from 
compounds 2-6 primarily by die presence of the C ring sub-unit system, wliich is 
critical ibr its high potency and tumor selectivity (Sakai al., 1996). 

Et-743 is expensive, sance natural marine products are typically difficult 

25 to obtain in suSicient quantity for cHtiicai use. Altbon^ aqtiaculture cuirently 
provides a source for Bt-743, it yieldsi only a small aonount of the compound per 
mass of inverteJsrate (the average yield of the tog from the ascidian ate 
gm^ally low (0.01-0.02 % dry weight)). Until recently, the most efficient 
method of Et-743 prodnclion was aqnacultnre of turUnata (Corey et al, 

30 1996). Efforts have been made to produce Et-743 more economically by 

synthetic processes (Martinez and Corey, 2000; Zhou et al., 2000; Herberich et 
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al.j 2001). A rslativsly efficient chemioaJ sya&esis exists for tlie compound but 
is a less proinisiiig source tiiaii aquaculture (Corey et al., 1996). Cuevas et ai, 
reported efiBcimt semi-syntJiesis of Et-743 ftom cyaiiosafraciu B (Compound 3) 
wMdi can be e^My obtaiived in kilogram quaotities from culture of 
5 Psmdomonas fiioresceas (Cuevas et al, 2000). Aji attractive alternative to 
aqaacuitare or even seiBi-s>'nt!\esis may be the cloning of the biosynflselic genes 
into a more tractable host, such as P,JIuorescerts. Thus, there ia a need for 
alternative methods for producing ectdnascidiKs, such as Et-743. 

iO Brief Dfesg riptioa of tfa<^ Mgares 

HG. 1 dq>icts the strucfare of Ecteinascidin-743 {Et-743). 
FIG. 2 depicts the modification of L-Tyr, 
FIG. 3 depicts a scheme for the synthesis of £1-743. 
FIG. 4 depicts in hybridization of bacteriaj 16S rKNA probes to 
15 lan.'ae of turhinaia. 

FIG. 5 depicts adult £*, turbinata showing brooding of eggs in lower 
portion of the zooid, (15X) (a); excised larvae at various development stages 
(20X){b). 

HG, 6 depicts (a) ESI mass spectrum firom the £. iurbinata larval extract; 
20 (b) MS/MS of &e parent ion (744) showicig Joss of the C-ring subunit of Et-743. 

HG. 7 depicts DGGE gel of PGR products amplified with 1055f-1392r. 
1 ; j?. fyirbinMia^ 2. E. turbinata, 3. colL Arrows point to the two bauds that 
were excised and sequenced. 

FIG. 8 depicts phylogram obtained with maximum parsimony and lOOx 
25 bootstrapping (indicated by numbers). 

FIG. 9 depicts HSH sxperimmts using fluorescein labeled probes. 
A, Domain Bacteria probe Eub338; B. no probe control; C. K palauensis probe 
1247r; D. E. sertuia prabe 1253r. 

FIG. 10 dqpicts FISH microgr^hs of J?, turbinata larvae using all 
30 excitation wavelengths on a conlbcal microscope. A. Wljole lan'a hybridized 
with CY5 labeled Eub338 indicated by bine) lOX, scale bar - 100 (B~D) 
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bfiages of larvae taken near aiSassive papilla© B. 40X^ scale bar = 20 pm, C. 
63X, scale hm~lQ \m, D. 63X plus digital zoora^ scale bar ~ 5 \im, 

PIG. II depicts PGR amplific^on of £. mrbinata larval DNA using a 
combination of laiiyersai, Bndobugtda specific aad E. seruda specific primers. 1 
5 ~E. tuf'bimta larml BNA, 2~B. pacifica BNA used as a conijtil, 3 - negative 
control. A; imiversal bacteria! pnia6rs(27f- 14921), B: umversaiwith 
Endohugula specific (27f-8S5r), C; E. serttda specific {240f-1253r), D: 
universal wiVaE. sertula specific (27f-1253r). 

FIG. 12 depicts light micrograph sectiotis of E. turhinata iai-va through 
10 the adhesive p^iUae, A. 16X» scale bar = 70 |im B. 40X, scale bar « 30 |im. 

Methods provided h^ein inclade using bacteria that are iiivolved at least 

ill part in the syntliesis of bioacti ve compounds. These bactesria caii be used iii 
15 production processes, siicb as fenrienialion, to produce valuable bioactive 

compounds. Altematively, polypeptides present in such bacterial or nucleic acid 
molecuks that encode such pol^'peptides can be used, to produce bioactive 
compounds in ex vivo processes or in recombitiant organisms engiiieered to 
express such polj'peptides. Hie present invention provides these and related 
20 benefits. 

The present invention recognizes that organisms such as bacteria contain 
nucleic acid molecules that encode polypeptides that catalyze the synthesis of 
bioactive compotmds, such as ectsinascidins, 

25 One aspect of the present invention is a bacterium or bacteria tliat can be 

a ptare or mixed culture, that at least in part syathesijses at least one ecteinascidin. 

A second aspect of the present invention is a preparation of bacteria irom 
m Ecteinascidia, such as Ecteinascidia turUnata, that includes at least one 
activity involved in the synthesis of an ecteinascidin. 
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A third aspect of the present isventioa is a composition including at least 
one nucleic acid molecule that encodes at least one polypeptide that catalyzes at 
least one step in the syn&esis of at least one ecteinascidin. 

5 A fourth aspect of the pr^ent iBvenlion i s a composMoii iiiGtoding a 

Mbrari'^ of nucieic add i»oleciiies of the present invention, lliese nucleic acid 
molecules can be used in a combinatorial biosynthesis of ecteinascidins. 

A fifft aspect of tfee ptcsmt invention is a composition including at le^t 
10 one polyp^tide that catalyzes at least one step in the synthesis of at least one 
ecteinascidin, whea-e prefmbly at least one polypeptide is derived finom at least 
one marine csrganisni, 

A sixth aspect of tJie pres^at invention is a composition including a 
hTsrary of poi>pe|>tides of the presmt invention. 
15 Antibodies 

A seventh aspect of the present invention is an antibody that binds wi& 
or specifically binds wi&i one or inore bacteria of the fsresent invention. 

An ei^ith aspect of the presatit invention is an antibody that binds with 
or i^ecificaliy binds wifii a pol>peptide of Uie present invention, 
20 Mmhod ofmakins bioactive compounds 

A ntoth aspect of the present invention is a method of making an 
ecteinascidiii using a bacteria of the present invention, 

A tenth aspect of the present invention is a composition made using a 
25 bacteria of the present invention. 
^ucl^ic Acids 

An eleventh aspect of the presmt invention is a method of maldng a 
composition^ including providing at least one nucleic acid of the present 
invention and synthesizing at least one ecteinascidin. 
30 A t\velf!h aspect of the pres«it invention is a composition made using a 

nncieic acid of the present invention. 
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Polypep tides 

A thirteenth aspect of tJie present mvention is a method of making a 
compoatioa inctodiug providing at least one polypeptide of the present iaveutiott 
and S3?iitfaesi2jag at least one ectdnascidin. 
5 A foiirtesiith aspect of the present irnrention is a compOisi tion made using 

a polypeptide of the present mvention. 
Method for idmtijying nucleic acid moleeules 

A fiflee»l3x aspect of the present invention is a method for identifying at 
least one ntjcleic acid molecule encoding at least one activity iavolved in the 
10 synthesis of an ecteinascidin, comprising contacttng a nucleic acid molecule of 
the present invention with a sample, and idaitifying tmcleic acid molecules in 
said sample that hybridize mtii a nucleic acid of tlie present invention, 

A sixteenth aspect of the present invention is a nucleic acid molecule 
identified by a method of the present invejition. 
15 A seventeenth aspect of the present invention is a composition 

comprising a library of nucleic acid molecules identified using a method of the 
present invention. 

Method for identifying hioacfiv? t^^pmmds 
Bacteria 

20 An eightessath aspect of the present invention is a method for identifying 

a bioaciive componnd including contacting a compound made or identifisd using 
a bacteria of the present invention witli a detection system and determining tlie 
bioactivity of the compoimd, 

A nineteenth aspect of the presmt invention is a compound identified by 

25 this method. 

Atweatieth a^ect of the present inve3iti0ti is a method for id«sitifpig a 
bioactive componnd mcluding contacting a cotnpoand made or identified using a 
nucleic acid molecule of the present invention with a detection sj^tem and 
30 doteiminiag the bioactivity of the compomid. 
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A tweaity-first aspect of Que present inventioa is a compound idmtified by 
this method. 

Polypeptide 

A twenty-second aspec t of iiie pr^mt iiiveatioii is a mefiiod for 
5 identi^'iBg a bioaeti^^e ODmpoimd including contacfeig a compouxid made or 
identified using a pol>peptide of the present invention with & detection system 
and dctenioiinBg the bioactiyity of said coinpoiind, 

A twenty-third aspect of the present invention is a compound identified 
by this method. 

10 

fc^tgeg P^^iiiptiOB of tfte |yventi<)B 

Definitions 

TJrit^s defined otherwise, all technical and scientific terms used herein 
have tbe same meaning as commonly imderstood by one of ordinary'- skill in tbe 

15 art to wliicb tins nivention belongs. Generally, tiie nomenclatm'e used herein and 
the laboratory procedures in cell cutore, cheniistrj', microbiology', mokcular 
biology, cell science ceil cultave described below are well known, and 
conmionly employed in the ait Conv«Rtional methods are used for these 
procedures, such as those provided in the art and various general refareaces 

20 (Sambrook et al., 1989; Ansubel et al, 1998; Harlowe and Lane, 1988). mere a 
term is provided in the singular, the inventors also contemplate the plural of that 
term. The nomenclature used barein and the labomtoi^ procedures described 
below are tliose well kno-^Ti and cojmioniy employed in the ait. As emplo>'ed 
througfaout the disclosnre, ihe foiiots'ing terms, unless otherwise indicated, shall 

25 be understood to have &e foHo'Rdng meanings; 

'M^Bbrane permeant dadvative" refers to a chemical derivative of a 
compound &at increases membrane penneability of tiie compound. These 
dmvatives are made better able to cross cell membranes becatise hydrophilic 
groups are masked to provide more hydrophobic derivatives. Also, the maldng 

30 groups can be designed to be cleaved firom the compound within a ceil to make 
the compound more hydrophilic ones within the cell Because the substrate is 



more hydropMUc &aii the membrane permemit derivative, it preferentially 
localiK&s mthin the cell {U.S. Patent No. 5 ,741 ,657 to Tsien et ai., issued 
April 21, 1998). 

"Isolated pol>'nucleoticie" refers to a pol>TOcleotide of genomic, cDNA, 
5 PCR or svnthetic origin, or some combmatioK thereof, wMch by vhme of its 
origin, the isolated polynucleotide (1) is not associated with the cell iii which the 
isolated polyaucieotide is fotmd in Katare, or (2) is operahly linked to a 
polymicl^tide th^ it is not linked to in nature. The isolated polynucleotide can 
optionally be linked to promotea^, enhancers, or oiiher regulatory sequences. 

10 **fsolated protein" refers to a protein of cDNA, recombinant RNA, or 

synthetic orfgin, or some combination tiereof; which hy virtue of its origin the 
isolated protein (1) is not associated with proteins nonnaJly tbimd wifhia miixue, 
or (2) is isolated from the ceil in wMch it normally occms, ox (3) is isolated free 
of otha- proteins from the same cellular source, for example, free of cellular 

1 5 proteins), or (4) is expressed by a ceil &om a dififerent species, or (5) does not 
occur in nature. 

•Toiypeptide" is used herein as a generic term to refer to native protein, 
fragments, or analogs of a polypeptide sequeaaoe. 

"Active ftagment" refesrs to a fragment of aparent molecule, such as an 
20 organic molecule, nucleic acid molecule, or protein or polypeptide, or 

combinations thereof, that retains at least one activity of tlis parent inolecule, 

"Naturally occurring" refers to the fact that an object can be found in 
nature. For example, a polypeptide or pol>'iiucleotide se<}uence that is present in. 
m. orgaaism, including viruses, fliat can'be isolated Jfrom a source in nature and 
25 which has not been intentionally modified, by man. in the laboi^ory is nahiraliy 
occuning- 

"Opscafaly linked" refe^ to a juxtapoation wherein tlie eomponente so 
described are in a relationship permitting them to function in their intended 
manner. A control sequmce operably linked to a coding sequence is ligated in 
30 such a way that expression of the coding sequence is acMeved under conditions 
compatible with the control sequences. 
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"Cotttrol sequences" refer 1© polynucleotide seqiiences that effect the 
expression of coding and non-coding sequmces to which tlioy are ligated The 
nature of such control sequences differs d^^diag upon the host organism; in 
prokaryotes, such control seqaences geaierally include promoter, ribosomal 
5 biding site, and transcription termination sequences; in safcaryotes, generally, 
such control seqiietices include promoters and transcription teiimnation 
seqacnces. ITie term control sequences is intended to inclnde components whose 
presence cm inSuence expresdoa, and can also include additional components 
whose presence is advantageous, for example, leader sequences and fusion 
10 partaer sequenceSx 

**Poiyiiticleotide" refers to apolymeric fona of nucleotides of a least ten 
bases in length, edtliea: ribonucleotidfis or deoxyribonucleotides or a modified 
from of eith"^ type of nucleotide. The tean includes single and double stianded 
forms of DNA or RNA. 
15 "Crenomic polynucleotide" refers to a pordon of tlie gei^ome. 

"Active genomic polwucleotide" or active portion of a genome" refer to 
reports of a genome that can be regulated, do%ii regalated or both, either 
directly or indirectly, by a biological process, 

"Directly" in the context of a biological process or processes, refers to 
20 direct causation of a process that does not require intermediate steps, usually 
caused by one molecule contacting or binding to another molecule (the same 
type or different type of molecule). For example, molecule A contaols molecule 
B, which causes molecule B to exert effect X that is part of a biological process. 

"Indirectly" in the coiitext of a biological process or precesses, refers to 
25 indirect causation that requires intermediate steps, usually caused by two or more 
direct st^s. For example, molecule A contacts molecule B to exert effect X 
which in turn causes effect Y. 

"Sequence identity'* refers to the proportion of b^e matches between two 
nucleic acid sequences or tiie proportion of amino acid matches between two 
30 amino acid seguencss. When sequence identity is expressed as a percentage, for 
example 50%, the percentage denotes the proportion of matches of the length of 
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sequences from, a desired sequence that is compaired to some other ssqutmce. 
Gaps (in either of the two sequences) are pemritted to maxiinize matchmgj gap 
lengths of 15 bases or less are usually usedj 6 bases or less are preferred with 2 
bases or less more preferred. When using oiigonuleotides as probes, tiie 
5 sequence idsutit>' between the target nucleic acid a^d the oligonuclsotlde 
sequence is preferably not less than 10 Im-get base matches out of 20 (50% 
ideatit)') and more preferably not less than about 60% identity^ 70% identity, 
80% id^tity or 90% identic), and most preferably not less than 95% idestity. 
"Selectively hybridize'* refers to detectafoly and specifically bind, 

1 0 Pol]yxiucleotides, oligonucleotides and firagmrajts thereof selectively hybridize to 
target nucleic acid strand, under hj^sridization and wash conditions tliat 
rmn imiae appreciable amounts of detectable binding to nonspecific nucleic acids. 
High stringency conditions can be used to achieve seiective hybridization 
conditions as kno\-Ni3 m the art, Gerserally, the nucleic acid sequence identity 

1 5 between the polynucleotides, oligonucleotides, and iragments thereof and a 
nucleic acid sequence of interest will be at least 30%, and more typically and 
preferably of at least 40^i, 50%, 60%, 70%, 80% or 90%. 

ITj^ridization and washing conditions are typically performed at high 
stringency according to conventional hybridization procedures. Positive clones 

20 are isolated and sequenced. For example, a Ml length polynucleotide sequence 
can be la:beied and used as a hybridization probe to isolate genonsic clones frona 
an appropriate toget library as they are known in the art Tjpical h>'foridization 
conditions and metliods for screening plaque lifts and otlier pmposes are known 
in the ait (Benton and Davis, Science 196:180 (197S); Sanibroofc et al, supm, 

25 (1989)). 

Two ajnino acid sequences have share identity if there is a partial or 
complete identity between their sequences. For example^ 85% jdeaititymeans 
tiat 85% of the amino acids are identical when the two sequences are aligned for 
maxixQuna matching. Gaps (in either of the two sequences bdng matched) are 

30 allowed in maxiiidzing matching; gap lengthi^ of 5 or less are preferred with 2 or 
less being mom prefen-ed. Alternatively and pref^xably, two proteha sequences 
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(or polypspiide sequmces derived from fiiem of at least 30 animo aci<te in 
Isagtb) shsre identity, ^ this tmm is used herein, if tiiey have an alignment score 
of at least 5 (in standard deviation units) using tiie prograjn .ALIGN with, the 
mutation data matrix and a gap penalty- of 6 or greater (Dayboff, m Atlas of 
5 Protein Sequence and Structure. National Biomedical Research Foundation, 
volujtne 5, pp. 101-110 (1972) md Supplenient 2, pp. 1-10). 

"Corresponds to'* refans to a poljaiucieotids sequence that shares identity 
(for example is identical) to all or a portion of a referesice polynucleotide 
sequence, or that a polypeptide sequence is identical to all or a portion of a 

10 reference polypeptide sequence. In contradistinction, tlie term "coropiementary 
t»" is med herein to mean that the complementary sequence is homologous to all 
or a portion of a reftaresnce polynucleotide sequence. For iUastration, the 
nucleotide sequence TATAC con'esponds to a reference sequence TAT AC and is 
complementar>' to a i-eference sequence GTATA- 

1 5 The following terms are used to describe the sequence relationships 

betvi-een two or more pol>'nucleotid^; "refcfrence sequence," "comparison 
window," "sequence idesnti^' " "percentage of seqnaace identity " and 
"sttbstaiTtial identity." A reference- sequence is a defined sequence used as a 
basis for a sequence comparison; a reference sequence can be a subset of a larger 

20 sequence, for example, as a sepnent of a Ml length cDNA or gene sequence 
given in a sequence listing, or may comprise a complete cDNA or gene 
sequence. Generally, a reference sequence is at least 20 nucleotides in length, 
frequently at least 25 nucleotides in len^, and often at least 50 nucleotides in 
length. Since two polynucleotides can each {!) comprise a sequence (for 

25 example a portion of the complete polynticleotide sequence) Ifcat is similar 
between the two polynucleotides, and (2) may further comprise a sequence that 
is divestment between the two polynucleotides, sequence comparisons between 
two (or more) polynucleotides are typically performed by comparing sequences 
of the two pol>*nncleot5des ov^ a "comparison window" to identify and compare 

30 local regions of seqtimce similarity. A comparison window, as used herein, 
refers to a conceptual se^ent of at least 20 eontipious nucleotide positions 
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wherem a polyimcieotide sequence may be compared to a teference sequence of 
at least 20 contiguous nticleoddes and whereiia the portion of the poljuiicleotids 
sequence m tbe ccmipaxi&on wmdow can comprise additions and deletions (for 
example, gaps) of 20 percent or less as compared to the refsreace sequence 
5 (which woiM not conipri se addi ti ons or dsletiojss) for optimal aligimieM of &e 
two sequences. Optimal aiigiimmt of sequence for aiigumg a comparison 
window can be conducted by the local identity aigoritbxn (SmUh and Watertnan, 
Adv, Appl Math,, 2:482 (1981)), by the identity alignmeat algorithm 
(Neediranaa aad Wunscli, J. Mol Bio., 48:443 (1970)), by the search for 

10 similarity xneM (Pearson and Lipman, Pnoc. Natl Acid. Sci. U.S. A, §5:2444 
(198S)), by the computerized implemmtatibns of these algorithms such as GAP, 
BESTFir, PASTA and TFASTA (Wisconsin Genetics Software Page Release 
7.0, Genetics Computer Group, Madison, ^VI), or by inspection. Preferably, the 
best aiignmsnt (for example, the result having the higliest percentage of identity 

15 over the comparison window) generated by the various methods is selected. 

"Complete sequisice identity** means that t\vo polynucleotide sequences 
are ideatical (for example, on a nucleotide-by-sucieotide basis) over the window 
of comparison. 

"Percentage of sequence identity" is calculated fay comparing two 
20 optimally aligned sequences over the viindow of comparison, detrnnining the 
number of positions at which the identical nucleic acid bass occurs in both 
sequences to yield the number of matched positions, dividing the aomber of 
matched positions by the total number of positions in tlie window of comparison 
(for example, the window size), and multqjlying the result by 100 to yield the 
25 percentage of sequence identity. 

"Substantial identity" as used herein denotes a characteristic of a 
polynucleotide sequence, wherein iiie polynucleotide comprises a sequence that 
has at least 30 percent sequeance identity, preferably at least 50 to 60 percent 
sequence, more usualiy at least 60 pearcaat sequence identity' as compared to a 
30 reference setjuence over a comparison window of at least 20 nucleotide 

positions^ fequently over a window of at least 25 to 50 nucleotides, wherein the 
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percen^e of scqumce i<3eatity is calculated by comparing fhe reference 
sequence to the polynucleotide sequence that may include deletions or addition 
wfaidi total 20 percent or less of the reference sequence over fee wadow of 
comparison, 

5 "Subatantial idsnXity" as applied to polj-peptides herein meaas that two 

peptide sequences, whea optijEMily aligaedj sxkch. m by the programs GAP or 
BESTFIT using default gap wei^ts, shai'e at le^t 30 percent sequence identity^ 
preferably at least 40 percent sequence identity, and more preferably at least 50 
percent sequence identity, and most pj'^ferably at lest 60 percent sequence 
10 identity, Prefejsbly, residue positions, which are not identic^, differ by 
conservative amino acid substitutions. 

"Consta-vative amino acid substitutions" refer to the iT5terch;angeabiiity of 
residues having similar side chains. For example, a gi^^up of amino adds having 
aliphatic side chains is glycine, alanirie, valine, leucine, and isoleucine; a group 
1 5 of anano acids having aliphatic-hydroxyl side chains is serine and thi eonine; a 
group of amino acids having amide-containing side chains is aspara^e and 
gltttamine; a ^oup of amino adds having aromatic side chains is phenyialanine, 
tyrc^ine and tryptophan; a group of amino acids having basic side chains is 
lysine, arginine and histidine; md a group of amino acids having sulfur- 
20 containing side cban is cystein and methionine. Preferred consen'ative amino 
acid substitution gamps are: vaHne-leucine-isokiicine; phenylalaim^e-tjxosine; 
fysine-arginiue; alaiutie- valine; glutamic-aspartic; and asparagine-glutamine. 

"Modulation" refers to tlie capacity to either sjihance or inhibit a 
fimctional property of a biolo^cal activity or process, for example, enzyme 
25 activity or receptor hindmg. Such enhancement or inhibition may be contingaat 
on the occurrence of a specific event, such as activation of a signal transduction 
pathway and/or may be naamfest only in particular cell types. 

"Modulafior'* refers to a chemical (naturally occurring or non-naturally 
occurring), such, as a biological macromolecule (for example, nucleic acid, 
. 30 protein, non-peptids or organic molecule) or an extract made from biological 
materials, such as prokKyotes, bacteria, eukaryotes, plaiits> fimgi, multicellular 
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orgmiisius or animals, mvertebrates, ^'ertebrat^, mammals and himiaiis, 
including, where ^propriate, eodracts of: whole orgaiiisms or portions of 
oigani^s, ceils, organs, tissues, fluids, whole cultures or portions of cultees, or 
enwonmaatel samples or poitioiis thereof. Modulators are typically evaluated 
5 for poteatial activity as ijiliibitors or activators (directly or indirectly) of a 
biological process or processes (for example, agonist, partial antagonist, partial 
agonist, antagomsij aiitirteopiastic, cytotoxic, inhibitors of neoplastic 
transfoimatioa or ceil proliferation, ceilprolifeation promoting agents, aativirai 
agents, antimicrobial agoits, antibacterial agents, antibiotics, and the like) by 

1 0 inclusioa in ^says described herein. Tho activity of a modulator may be Icnown, 
unknown or partiaily known. 

'Test chemicar* refers to a chemical or extract to be tested, by at least one 
method of the present invention to be a putative rRodulator. A test chemical is 
nsually not known to bind to the target of iriterost. "Control test cbemicar' refm 

15 to a chemical known to bind to the tinget (for example, a known agonist, 

aatagonist, partial agonist or inverse agonist). Test chemical does not typically 
include a ehemicai added to a mixture as a control condition that alters the 
fimction of the target to diJtermine signal specificity in an assay. Such control 
chemicals or conditions include chemicals tiiat (!) non-specificaUy or 

20 substantially disrupt protdn structure (for example denaturing agents such as 
urea or guandium, sulfhydryl reagents such as ditbiotritol and beta- 
mercaptoethanol), (2) generally inhibit cell .metabolism (for example 
mitochondrial imcoupies) and (3) non-specifically disrupt electrostatic or 
hydrophobic interactions of a protein (for example, high salt concentrations or 

25 detergents at conccnfcratiojas sufficient to non-specifically disrupt hydrophobic or 
electrostatic interactions), Tbe term test chemical also does not typically include 
chOTiicais known to be unsuitable for a ther^eufic use for a particular indication 
due to toxicity of the subject Usually, various predetermined concentrations of 
test chemicals are used for determining their activity. If the molecular wel^t of 

30 a test chemical is known, the following ranges of concentrations can be used; 
between about 0,001 micromolar and about iO miOimolar, preferably bet^'esn 
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about 0.01 niicromolar aad about 1 imllmioiarj more preferably between about 
04 raiicroHiolar aiid about 100 micromolar. When extracts are lises a test 
cbemicals, tihe couoeatration of test cbemical used cao be expressed on a weight 
to vohime basis. Under fliese circumstances^ &e foliowhig ranges of 
5 concmtratioiis caa be used: between about 0.001 niicrogmms/njl and abo«t 1,00 
milli gram/'ml, preferably betweea about 0.0 1 micro grams/mi and sbout 1 0 
miliigraiBs.c'mi, and more preferably bettii^een about 0.1. microgiBm&'ml and about 
1 niilligraitt&f^Bl or between about 1 imcrogram/ml asid about 100 
micrograms/ml. 

10 "Targef ' refers to a biocheisieal entity involved in a biological process. 

Targets are typically proteins that play a useful role in the physiology at biology 
of an organism, A therapeutic chemical typically binds to a target to alter or 
modulate its fonction. As used herein, targets can inchide, but not be limited to, 
cell sm^fece receptors, G-proteiBs, G-proteis,i coupled receptors, kinases, 

15 phosphatases, ion chaimels, lipases, phosholipases, nuclear receplDts, 
intracellular siructisres, tnbnles, tubulin, and the like. 

"Label" or "labeled" refers to incorporation of a detectable marker, for 
example by incorporation of a radiolabled compound or attachmaa-t to a 
polypeptide of moieti'es such as biotiin that caa be detected by the binding of a 

20 section moiety, such as maiked avidin. Various methods of labeling 
polypeptide, nucleic acids, caibohydrates, and other biological or organic 
molecules are known in the art. Such labels can have a variety of readouts, such 
as radioactivity, i3uorescence, a>lor, chemilmmmescence or other readouts 
known in the art or later developed. The readouts can be based on enEvmaiic 

25 activit)'', such as beta-galactosidase, beta-lactamase, horseradish peroxidase, 
alkaline phosphatase, litciterase; radioisotopes such as -H, "C, ^^S, '^1 or 
fluorescent proteins, such as greaa fluorescent proteins; or oth^ fluorescent 
labels, such as FITC, ihodamine, and laalhanides. Where ^ropriate, these 
labels can be the product of the expression of reporter genes, as that term is 

30 understood in the art. Examples of reporter genes ai-e beta-lactamase (U.S. 

Patent No. 5,741,657 to Tsien et al., issued April 21, 1998) and green fluorescent 
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protein (U.S. Patent Ko. 5,777,079 to Tsien et al, issued My 7, 1998; U.S. 
Patent Ho. 5,804,387 to Cormack et al., issued September 8, 1 998). 

"SubstantiaUy pivre" refers to &x\ object species or activity that is tiie 
predominant species or acti\ity preseat (for example on a molar basis it is more 
5 abimdmtt thaii any other individual species or activities in the composition) arsd 
preferably a substantially purified fraction is a coi»position wiiereia the object 
species or activity comprises at least about 50 percent (on a molar, weight or 
actiYxty basis) of all mactotnolecules or activities present Generally , as 
sebstantially pure composition will comprise more than about SO percent of all 

10 macromolecular species or activities present in a composition, more preferably 
more tlian about 85%, 90%, 95% and 99%. Most preferably, the object species 
or activity is purified to essential boniogeneity, whsreiu contaraiTiant species or 
activities carmot be detected by coaventional detection raethods) w'bsrein the 
composition consists esse-ntially of a single njacromoiecukr species or activity. 

1 5 Tiie invei^tors recogoize tliat an activity may be caused, directly or indirectly, by 
a single species or a plurality of specii^ within a composition, particularly with 
CEXtracts. 

A "ecteinasoiditt synthesis activity' is at least one activity involved in the 
synthesis of an eoteinascidin. Examples of ecteinascidin s>'n&esis acti\'ity can 
20 include without limitation a non-ribosomal peptide sjiitlietase activity, an O- 
methyi trantsfea^ase activity, or a trsiisaminase zctirity. 

'Tharmaceutical agent or drug" refers to a chemical, composition or 
activity capable of inducing a desired therapeutic effect when propeity 
administored by mi ^propriate dose, regime, route of administration, time and 
25 delivsy modality. 

•*Phannaceutical agent or drug" refers to a chemical, composition or 
activity capable of indmang a desired jiJierapeutic effect when property 
administered by an appropriate dose, regime, route of administration, time and 
delivery modalit>'. 

30 A '%ioactive compound'* refers to a compound, that exhibits at jea.5t one 

bioactivity. A bioactive compound may be an organic molecule or inorganic 
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molecule, protein, lipid, carboiiydrate, or nucleic- add, or any combiaatioii 
{Jiereof, including complexes of any of tbese con^omids. 

A "bioacti vity ' refers to at least one acti\ity that modulates a biolopcal 
process. Preferred bioactivitiss iBciude. but ;5re aot limited to: aiiiitiimor 
5 activity, mticancer cell activity, cytotoxic activity, antibacterial activit>'^, 
antimiciobiai activity', not being substaiitiaily sasceptible to molti-drug 
resistance, antiviral aotivityj iiiHaunoraodulatory activity, anti-infiammatory 
activity, and radiation protective activity. 

"An ecteinascidin" refers to a compound that includes a 
10 tetrahydroisoquinoline alkaloid derived fi-om the marine tunicate Ecteitiascidia 
turhinata and has at least one bioactivity. 

"Made at least in part" or "at least in part s>'ntbesized" means that an one 
or more aatities participate in one or more st^s in the syndesis of the compound 
of interest or precursor of a compound of interest but may sot be the sole mtity 
15 or entities that participate in the synthesis of the compound or precunsor of 
interest. Wbm a single st^ in the synthesis of a compound of interest or 
precursor of a con^und of int^-est is performed by one or more entities, "made 
at least in paat" can refer to the participation of one or more entities in a sj-nthetic 
steps or process that a!so requires tbe participation of at least one other emit}', 
20 A *'bioactive derivative" refeis to a modification of a bioactive compound 

or bioactivity that retains at least one characteristic activity of the parent 
compound. 

A "bioactive precursor" refers to a precursor of a bioactive compoimd or 
bioacti vity that exhibits at least one characteristic acti vity of the resulting 
25 bioactive compound or bioactivity. 

An, "antibacterial activity" refers to an activity that reduces the growth 
rate or numbers of living in a sample, such as a culture of bacteria or a sample 
that includes at le^t one bacteria, including a patient. Such antibacteriai activity- 
can be directed against Gram-negative or Gram-positive bacteria and can be 
3 0 screened for or confirmed using methods known in fee ait 
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All "antimicrobiL;; dctiviiv*' refers to an activity that reduces the gi-owth 
rate or nun -bers of li\ i.ng pjicrobes in a sample (including prokaiyotic and-'or 
eukarwtic microbes), such as a culture ox niicrobes or a sample &at includes at 
least o»c microbe, iuchvdiug a patient, and caii be screened for or confimaed 
5 using raetbodslcnown in flie art 

Aa ''antiyiial activity" refers to an activity aiat reduces 
viros partides in a sample* s»oh as in a sample including at least one vims, 
including a patient Sucia anti viral activity cais bs directed against, from example, 
DNA or KM A containing virases, includijig, but aiot limited to, herpes\'irus, 
1 0 hepatitis vims and retrovirus. Such activity can be screened for using methods 
Icnown ill the art. 

Ml "aiititumor actisdty" refers to an aotivit>' that reduces the growth rate 
or number of tamor cells in a sample, such b& a culture of tumor cells or a sample 
that includes at least one tumor cell, including a patient Such antitumor activity 
15 can be directed against any type of tumor or tumor cell* including, but not limited 
to r^al tumor, lua^ tumor, colon tumor, central nervous system tumor, 
melanoma, ovarian tumor and breast tumor. 

All "anticancer cell acti%it>'" ref^ to an activity that reduces the gtmih 
rate or number of cancer cells in a sample, such as a culixire of Cvancer cells or a 
20 sample that includes at least one cancer cell, including a patient. Such anticancer 
cell activity can be directed against any type of cancer cell, i ncluding, but not 
limited to renal cancer, leukania, lung cancer, colon caiioet, c^trai ner\'ous 
system cancer, melanoma, ovarian cancer and breast cancer. 

A ''cytotoxic activity" refers to an activity fiiat reduces the number of 
25 viable cells in a sample, mcluding prokaiyotic cells, eukaiyotic cells or both. 
Such aotivit)'^ cati be screwed using methods known in the 9Xt. 

A "patient" or "subject" refers a whole organism in need of treatment, 
such as a farm animal, companion animal orhnmaa An animal refers to any 
non-human animal. 

30 O&er technical terms used herein have their ordinaiy meaaiitg m the art 

^ tiiey are used, as exeinplifted by a variety of technical dictionaries, such as 
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flie McGraw-HiU Dictionaiy of Chemical Irnim aad the Stedmau' s Medical 
Dictionarvf. 

The biological and zoological nmctions of the ecteinasckims are 
unknovi''n. Some of tlie roles generally ascribed to secondary metabolites incliide 
5 deteiTence agamst predation, iiiinbition of the settlement of foiiliBg organisms, 
protection firom bacterial or visal mfecfions, a»d prevention of encroachment by 
adjacent organisiHS, 

Localization studies of compounds in sponges asoidians have 
depended upon the presence of unique chemical stnictural featiures, such as 

10 heavy eieinents or pH depsndeM fluorescence, high bioactivity, m&or the ability 
to separate morphologically distintct bacterial and eukatyotic cells (tiaompson et 
al, 19S3; Bewley et al, 1996; Salomon et al., 2001; andUnson et al., 1994). 
However, many natural products do not have distinctive stmciiiral 
characteristics, and may not have high activity in bioassays. In addition, some 

15 invertebrates are not amens^ie to tiissue dissociation and cell enrichment, and 
many bacteria do not have anique morphologies. Invertebrates containing 
interesting natural products are particularly difficult to work with becai^e msuiy 
of them cannot be cultured e^ily in a laboratory aad oft«i must be collected and 
preserved in remote, tropical areas. Additionally, sponge and ascidian systems 

20 are'coraplicated by the feet that the adults filter seaxvater for food and generally 
contain large quantities of dietary-derived bacteria within their mesophyl matrix 
and gut, respectively. In order to stndy the potential role of s^mbionts in the 
biosynthesis of specific compounds in such complicated systems, it is necessary 
to employ a combination of chemical, biological and genetic teclmiques. 

25 One method of studying symbionts in typically complex 

invertebrate/miciobiai associations is to tocxis on the chemistry and bactmal 
populations of non-feeding larvae. Many invertdjrate larvae swim for a stmt 
period of time (minutes to dajfs) and often do not filter feed until they 
pemianenfly settle onto a surface ai^d metamorphose into an adult (Stratenann, 

30 1985). Mmiy invertebrates associated with obligate s>'nibionts are kumm to 
transmit bacteria vertically to their ofFspring; thereforej any bacteria that are 
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comistently aad speGifically associated with the lawae ai-s presiimed to be &"ue 
symbionts (Kiiieger et aL, 1996; Beiiayalm and ScHeyer, 1998; Hirose, 2000). 
This approacli was used to implicate a bacterial s^mbiotit as a source of the 
potential anti-cancer agents br/ostatins irom the biyoziian Btigula neritina. The 
5 hryozoau larvae were foand to contam both the bryostatins as well as a 
moBOCulture of s>mbio1ac proteobacteria identified as Endobugula seitda 
cmididatm (Haygood and Davidson, 1997; and Davidson aodHaygood, 1999). 

The present invcaition recognizes Uiat organisms such bacteria contatn 
nuoleic acM molecules that encode polypeptides that catalyze the synthesis of 
10 bioaotive confounds, such as ecieinascidins. 

As a non-iimiting iiitroductioTi to the breath of the preseat invention, the 
preseaat irtveaition includes several gtmeral and useful aspects, including: ■ 



1) a bacteria that at least in part synttoesizes at least one 
ecteinsscidin; 

2) a preparation of bacteria fix>m an Ecteinascidia, such as 
Ecteinascidia turbUmta^ that indudes at least one activity 
involved in &e s>^thesis of an ecteinascidin; 

3) a composition including at least one nucleic acid molecule that 
encodes at least one polypeptide that catalyses at least one step in 
the synthesis of at least one ecteinasoidin; 

4) a composition including a li braiy of nucleic acid molecules of the 
present invmtion, where one or more of these aucieic acid 
molecules can be used iti a combinatoria! biosynthesis of 



a composition including at least one polypq>tide that catalyzes at 
least one st*^ in the synthesis of at least one ecteinasoidin, where 
preferably at least one polypeptide is derived Som at least one 
marine organism- 

a composition including a library of polypeptides of the present 
invention; 
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7) an antibody &at binds widj or ^ecifically binds with a bacteria of 
the p-esent invention; 

8) an antibody tiat binds with or speciiicaliy binds with a 
polypeptide of ttis present invention; 

9) a me&od of making an ecteinascidin using a bacteria of the 
present invention; 

10) a composition made using a bacteria of the present inventioBj 

1 1) a metiiod of makirig a composition, including pro viding at least 
one nt«jleic add of the present invention and synthesizing at least 



12) a composition made using a nucleic acid of the present invention; 
B) a method of mafciiig a compositjon including providing at least 

one polypeptide of the present invention and syntliesizing at least 

one ecteinascidin; 

1 4) a composition made using a polj-pepti de of tlie present invention; 

15) a method for identifying at least one nucleic acid molectUe 
encoding at least one activity of involved in the sjti thesis of an 
ecteinascidin, comprising contacting a nucleic acid molecule of 
the present invention with a sample, and ideatitifying nucleic acid 
molecules in said sample thai hybridize with a nucleic acid of the 
presmt Invention; 

16) a nucleic acid molecule identified by a method of the present 
inx'-eniion; 

17) a composition coniprising a libray of mioleic acid molecules 
identified using a method of the present invention; 

18) a method for identifying a bioactive compoimd including 
contacting a compoimd made or idesntified using a bacteria of the 
present invention and determining tlie bioactivity of the 
compound; 

19) a compound identified by this method; 
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20) amethodforideatiiymgabioactivecon3|)0iffidiijcluding 
contactiag a compoimd made or identified usiiig a nacldc acid 
molecuie of the present invention and detenuining the bioactivity 
of the compound; 

21) a compound identified by this metfeod; 

22) a method for idaitijptig a bioactive compound including 
contacting a confound made or idaatified nsing a polypeptide of 
the preset iaveation aiid detenaining the bioactivity of said 
compound; a»d 

23) a compound identified by tbis method. 



These aspects of the inveniioB, as well as oibsrs de^smbed herem, can be 
achieved by using tlis methods, artides of manufacture and annpositions of 
matter described herein. To gain a fiiB appreciation of the scope of the present 
1 5 invmtion, it will be fiirther recogmzed tliat vadons aspects of the present 
invention can be combined to make desirable embodiments of the invention^ 



I. Bacteria That Synthesize An Ecteiaascidiii At Least In Part 

The bacteria maybe a member of any bacterial species, such as species 
20 belonging to the eubactersa or aicbiiebacteTia, that s^Tithesize an ecteinascidiB, at 
least in part, fhreferabiy, the bacteria are marine bacteria, such as s>'mbionts of 
the tunicate Ecteimscidxa turbinata. The bactma are isolated from tiieir natural 
growth environment sacb that they are cultured in the absence of llieir host 
organism. 

25 Isolation, pnnfication, and cailturing of bacteria can be accomplished 

using standard microbiological techniques as they are known in the art. Sterile 
technique can be employed to prevent the introduction of exogenous bacteria of 
&ngi into the preparation. The bacteria of interest may be isolated from E. 
tm'hinata by disrupting or homogenizing a preparvation of turbinata. 

30 Disruption or homogemzation maybe done by crusliing, grinding, slicingj or 
nipturittg the E. turbinata. This may be done manually, with or wtliout 
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implemmts mch as homogenizers, razor blades, scalpels, mortar and pestlej and 
iiie like, or may be done wi& any of a variety of naacMnes such as, but not- 
limited to» a biender or a motorized homogenizer. 

The disruptate or homogenate maybe filtered to remove large particles or 
5 debris from ths preparation, in tliis case the filter lias a pore size large enough to 
allow bacteria to pass tlirougli. CenirifugatioTji may be used to sediment tlie 
bacteria. Preferably this is done after removing large paitides horn die disruptats 
or homogenate. Aiter cmtrifugation, ths supernatant is removed and the pellet 
containing the bacteria may be resuspended in a bu.Qbr ot media. 

10 The bnflfer or media to be used in resnspending the bacteria can be any 

media, but preferably is a media that pixjinotes the growth of Uie bacteria. Hie 
media optionally contains a caibon source, a nitrogea source, salts, extracts mch. 
as ye-ast, beef, or liver extract, md/ox vitmnins. It may be desirable to include 
extracts of the E. twbimta host orgsaiisin in the media. Such extracts may be 

1 5 talcen from the disruptate or homogenate supernatant, or may be fractions of an 
E. mrbinata disruptate or homogenate obtained by one or more centrihigations, 
column chromatogKiphy steps, precipitations, dialjses, or other fractionation or 
pnriiication st^s. Altrniatively, samples of ^. turbinata can be processed by 
sterilization, such as by autoclaving, and fee resulting preparation used in 

20 microbiological medium by the optioi^ inclusion of a gelling agent, such as 
pectin, agar or gelatin. Other nutrients can be added to such medium, such as 
vitamins, miiierals and carbon and/or nitrogen sources. Optionally, an ocean 
environment can be created by adding autoclaved or filter sea- water, or artificial 
sea water (such as Instant Ocean) to such media. Substances that inhibit the 

25 growth of contatninants can be included in &e mediimi. Such substances can be 
k»om antibiotics, or can be other substances fotmd to inhibit the growth of 
contaminant, and can be ecteinascidins. 

CiUtures may be obtained by plating the bacteda on solid or s^i-solid 
media or by propagating in liquid media and incubating the bacteria at a 

30 tenaperalnrs, pressure md atmospheric conditions that allows the bacteria to 
piopa^te. Preferably, such ctiltare conditions mkror or are similar to those that 
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the imcroorganian esncouaters in its m^ve enviroameni Pure cultores may be 
obtained by a variety of art recognized methods, siich as sequentially streaking 
colonies of bacteria fiomihe cijltore, and/or by sequential dilution of bacterial 
cultores. Cultures enriched for a particular microorganism can be obtained by 
5 ciiltiuiiij a sample m a sslcc tiVs or Cufcr&ntial miciobioio Jtt^ai nicibum tudi 
ti-idt the i:^Ltti\e pioportjoti apartjculai mitrooigaruisnj la mcied-sed re]du\ e to 
other miciooigatjisms Such ennchment procedures can be used restorati\ ely to 
provide for enriched cultures of varying purity. 

The bact^a may be cultured on solid or semi-solid mediaj bat once 

10 purifiedj, ths bacteria are prefarably cultured is liquid media. Wh&a. cultured in 
liquid media, the cultores may be shaken for aeration, or air may be injected into 
the culture through a tabe to introduce oxygeaa into the culture, if such bacteria 
are facijltatively a^bic or aerobic in metabolism. Preferably, the media t^ed to 
cuitiire such microorgaoisras mirrors or is related to the natural envirannieivt of 

1 5 ths microorganism. 

Preferably &e bacteria are grown to optimal density for the production of 
ecteinascidins or ecteinascidin precursors, and the bacterial culture is harvested 
and centrifiigedto sediment the bacteria. If the culhire media is to be used for 
extraction of one or more ecteinascidins or ecteinascidin precursors, the 

20 supernatant is removed &om the bacterial pellet. Purification of one or more 
ecteinascidins or ecteinascidin precursors may be perfonned tln-ougk any 
methods kno-wn or developed in the future in the art, including combinations of 
any of the following; dialysis, precipitarions, solvent partition, and 
clnomfsiograpliy, including high speed countercurrsnt chromatography 

25 (HSCCC), nonnal, and iwerse phase chromatography. Other methods for 
purifying organic molecules are known in the art and may be employed as 
apptopri^e. 

Activities of ectein^idins may be a^yed by any of the following art- 
recognized methods: An antitumor activity may be assayed 1?? vivo using 
30 xenografts of human tumors in nude mice (see, for example, Veloti et al., 1.977- 
1983 and U.S. Patmt Numbers 5,256,663 and 5,089,273, herem incoiporated by 
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refeence) or ear vivo (Hakala, et al,, 1987). An and-cancer cell activity may be 
assayed using cell lines such as the P-388 murine leukemia cell Hue, the HT-2.9 
human colon cancer cell line, dxid the A-549 human hxng adeiiocai-ciBoraa cell 
line (see, for example, U.S. Patent 5,478,932^ and 5,089,273 hereby iDcorporated 
5 by reference and references disclosed tfaet'eia.) A. cytotoxic activity can be 
assayed by using the L12I0 muriae letJicemia cells or the CV-1 monkey kidney 
cen line as disclosed in U.S. Patents 5,478,932 aad 5,256,663, both hereby 
incorporated by reference. Assays to determine antibacterial and antiviral 
activities are also well known in the art and may be done as disclosed in U.S. 
1 0 Pataat numbers 5,089,273 and 5,484,717, and references disclosed therein. 

li. Nucleic Acid Molecaleis That Encode Polypeptides That Catalyze the 
Synthesis of An Ecteinasddin At Least la Part and Libraries of Such 
Nucleic Acid Molecules 

1 5 The present invfaitioa hicludes a composition incinding at least one 

nucleic acid molecule, such as a substantially purified or purified nucleic acid 
molecule, that encodes at least a portion of at least one polx-peptide that catalyzes 
at least one ut&p ki the synthcvsis of at least one ecteinascidiii. Preferably, at least 
one nucleic acid molecule is derived from at least one marine orgaiHsm. The 

20 nucleic acid molecules of the present invention can comprise the nucleic acid 
molecules disclosed herein, including PCB. primers, portions thereof, and nucleic 
acid molecules tliat selectively hybridize with or have substantial identity with 
the nucleic acid molecules disclosed herein or portions tha-eof, or encode at least 
one conservative amino acid substitution relative to tiie disclosed sequmces or 

25 portions thereof. A nucleic acid molecule of the present invention caa be DNA 
JRNA, single strmded, double stranded or any combination thereof 

A nucleic acid molecule of the present inveutiou preferaijly esjcodes at 
least aportion of a pol>'peptide involved in the sjiithesis of at least one 
ecteinascidin. Preferably, the poiypep tide is at least a portion of a protein or 

30 complex that has non-ribosomal peptide transferase activity, diat has 0-methyl 
-transferase activity, that has transaminase activity or that participates in the 
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synthesis or mcKiification ofou? a\ 'note et,'en'!asC'Cin& \ nucleic acid molecule 
of tiie present invention prv ^ r dK od^5 at 1s.kt ^ portion of at least one s«ch 
active do.iwm iUiJ ' . li.' n 'ci o e ^' \'t> of •^uJiau :iaue doniai'-i 
prefsiabl) an acuMi^ ]i t ^ i i it l^^it o tf* -^U^p m ihe sjniiiesi" of an 
5 ecleind'-cidm Pietc-abl}, a nucl ic ^cid riolecule ot the ptes:;nl in\entif>n is 
between about 10 bases or base pairs sad about 500 Kb, between about 100 
bases or base pairs and about 250 Kb ot between about 500 bases or base pairs 
and about 1 00 Kb in leaagth. 

A nucleic acid molecule of the present mvention can be deiived &om at 

10 least one marme organism. A marine organism caa include any organism tliat 
can be toimd m a marine enviromjieni, either iiatiirally or xenotypicaliy. A 
ittarine organism can be a vertebrate, an invertebrate or a raaicelluiar organism, 
such as a fungi, algae or bacteria. Preferably, a marine organism is an 
iir^fertebrate, such as m. Ectemmcidia^ such as Ecteinasddia ttirbimta^ or a 

15 tmioellular organism, such as a bacteria, such as a bacteria associated with an 
Ectmmscidia, such as Ecteinasddia turbimta. 

A nucleic acid of the present invention can also encode a fusion proteirt 
that includes a polypeptide of the present invention and a pol>'peptide of interest. 
A pol)peptide of interest can be asy -polypeplide, but is preferably a detectable 

20 label, such as green fluorescent protein, or a sequence that aids in the purification 
of apolypeptide> such as FLAG, A nucleic acid that encodes a fusion protein 
can be made by operably linMng a nucleic acid that encodes a polypeptide of 
interest with a i»ucleic acid that encodes a polypeptide of the present invention. 
The operable linking c^ be direct c$ indirect, such as in the case where a linker 

25 connects the polypeptide of the present invention with a polypeptide of inttsrest. 
The nucleic acid molecule of the ipresent invention and the nucleic acid that 
mcodes a polypeptide of interest are preferably operably linked in frame such 
that an operable polypeptide of the present mvention and an operable 
polypeptide of interest aie translated feom tlie nucleic acid, but that need not be 

30 the c^. 
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Nuoieic acid molecules of the present iuvention can be made usmg 
metliods laiown m the art and. described herein (sm, Brnmbmok et al, supra 
(1989); Aiisubel et al, sqira (1998)). For example, nucleic acid molecules of 
the present mvontion can be identified and isolated using PCR methodolo^es, 
5 includmg RT-PCR, aad sequenced using established metbods such tiiat their ■ 
homologies can be determined. The ability of one nucleic add molecule to 
hybridize witb another can be detenmnjed through experimentation under a 
variety of stringencies, or can be estimated based on their length mid G:C 
contents. Alterations of identified sequences can be made using routine 

10 methods, such as mutagenesis, RT-PC31 or oth^ PCR methods (See, Sambrook 
et al., 1989; AttSttbei et al, 1998). 

A nucleic acid molecule of the present invention can include at least one 
expression control sequence. Prefeobly, an e;!q>ression control sequence is 
operably lix:kcd to a nucleic acid molecule such that the nucleic acid molecule 

1 5 can be expressed in an in vivo or in vitro transcription and/or translation system. 
The choice of expression control sequences is dependent upon the transcription 
syst^n. to be used. For example, if a prokai^'otic organism such as E. coli is to 
be used to express a nucleic acid molecule, then at least one appropriate 
prokar>'Otic expression control sequence would be used. Likewise, if a 

20 eukaryotic organism is to be used to express a nucleic acid molecule, then at 
least one appropriate eukaryotic expression control sequence, such as tlie CMV 
pronaoter or LTRs would be used. Such nucleic acid molecules can be in any 
form, such as in a plasmid or in a linear form. 

A nucleic acid jnolecule of tibe present inveaition can be provided witii or 

25 without expression control sequences in a vector, such as a plasmid or a vii'al 
vector. Viral vectors can be chosen so that they are appropriate for a cefi to be 
traiisfected, such as, for example, a phage, cosmid, retrovirus, vaccinia virus, 
adenovirus or adeiio-associated vijus. Viral vectors can introduce a nucleic acid 
molecule mto a cell during its nonnal biologicai processes. Non-viral vectors 

30 can be i^ed to introduce a nucleic add molecule of the present invention into a 
host cell using methods known in the artj such as lipofection, cold calcium 
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chlonde treataent or electroporation. The nucleic acid molecule in a cell can be 
extrachromosomal or be integrated mtc ifee genome of tlie cell. The host ceil can 
be any appropriate host cell, such as a eafcaryotic or prokaiyotic cell Preferably, 
a nucleic acid molecule of the present inv^tion is esscpressed in the cell, but tliat 
5 is not a requirement of the invention. Preferably,, the cell does not Tiotinally 
include a nucleic acid molecule of ite present invention or express a polj'pqjtide 
of the present invention, but that need not be the case. For exatQpIe, a cell thai 
expr^es a reiatively low amount of a polypeptide of fee present invention can 
be made to express relatively hi^er aroounts of a polypeptide once transfected 

10 mth a nucleic acid oftfae present invention. 

Cells that express a polypeptide of tbe present invention can be screened 
for and selected using a variety of methods, including those set foitii in the 
present invaiiliott. For example, inununoassays, such as Vv'estem blots, can he 
used to identify cell lysates iliat iiKinds:' a poI>peptid.e of tlje present invention. 

15 la addition, innnniiocytocbamistry can be used can be used to identify and 

localize a polypeptide of the present invention on or within a cell. Fmthennore, 
in situ hybridization mettiods, sncb as FISH, can be used to idmtify and localize 
nucleic acid molecules -within a cell and hybridisation methods can be used to 
identiJfy nucleic acid molecules, either DNA or RNA for cellular preparations. 

20 Cells or cell iysates can be screened for an activity using a variety of methods. 
For exaoople, the ability of a cell or cell lysate to bind with a substrate or convert 
a substrate, including a detectably labeled substrate, can be used to detect a 
parricoiai" acti\it>' (see Haygood aiKf Davidson, 1997). 

Nucleic acid molecules of the presrait invention can encode peptides that 

25 have non~ribosomal peptide synthetase activity, have O-methyl transferase 
acti-vity, have transaminase activity, or that participate in the synthesis or 
modification of one or more ectdsascidins. Nucleic acid moleoules aacoding 
peptides having such activity cm be identified by making comparisons of nucleic 
acid sequence or translation amino acid sequences derived therefrom using 

30 aietliods known in the m% including BLAST comparisons. A nucleic acid 

molecule of the present invention can be expressed and the expression products 
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scj-eened and confimied for baviiig such activity. In addition, nucleic aoid 
moleottles of the present invetitioB cas eacode poiypqptidea that have other 
activities. Metlsods for screeniog such ac^vities are faiomi in the art. 

Tfie preseat im'mtioB also includes a ljbrar>^ of nucleic acids of the 
5 presetJt iaveation, A library of auclQC acids includes between abont two, about 
foitr, about six, about ei^t, about tea, about Uiirty, about ses^enty^ about oae- 
h»iidre4 about osfr-thousand, about ten-Uiou&and, about one-bmidred &omaad 
or about ones-miilion nucleic acid molecules and about three, about about 
seven, about twenty, about fifty, about five-huudred, about fi%4biO«saii4 about 

10 five-hundred tiious^d and about tea million nucleic acid molecules. The 
members of such a library are preferably different nucleic acid molecules, but 
that need not be the case. 

The nucleic acid molecules of the present invention can be used for a 
variety of applications, and therefore include but are not limited to, PCR primers, 

15 probes to idesttify siiailar sequences, aud coustructs for in vitro or in vivo 

transcription and/ or traoslation to make polypqjtides of the present iavention. 
The particular appHcaUoa of a nucleic acid molecule depends on a variety of 
factors, socb as ibey are known in the art, inchvde the length, straadedtjess (single 
straaded or double stranded and positive $erijse or negative sense), cheniicai 

20 characterization (such ^ DNA or RKA) or whether the nucleic acid molecule is 
detectafely labeled or not 

m. Folj^peptid^Mt That Catalysses the Syafeissis of Aa Ecteinaseidin At 
I^east in Part aod Libraries Thereof 

25 The present Invention also includes a compositicai including at least one 

polypeptide or a portion th^eof that catalyzes at least one step in the sjnthesds of 
at least one ecteiQascidin, wherein the at least one polypeptide or a portion 
thereof is preferably derived from at least one maiiae organism. 

A polypeptide of the present invention cajti be- deri\^ from at least one 

30 marine organism. A n^ie organism cm isKlude my organism that can be 
found in a marine euvitomnent, eiUier naturally or xenotypically. A marine 
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organisna can be a veitebrate, an im^ertebrate or a rniicellular orgaiusnj such as a 
fiaigi, aigae or bacteria. Preferably, a laaxine orgmisci Is an mvertebrats, such as 
mBcteinascidia such as Ectebtmcidia twbinaia^ or a umcelMar organism, such 
as a bacteria, such as a bacteria of tiie present isv^tion. 
5 The nucleic acid nioiecules of the presmt inveiitiofl can be translated to 

pro vide poi>pep{ides. These polypeptides can be substantially purified or 
purified and preferably have at least one activity of a pol>peptide tliat 
participates in the syiithesis of an ecteinascidin, such as but not limited to, anon- 
ribosomai peptide synthetase (MRPS), an O-methyl transferase, a transaminase, 
iO or any poi>'peptide that participates ia the synfliesis or modification of an 

ecteinascldiii. Assays for non-ribosomal pqptide synthetases are in Uie art 
(see for example, MaraMel et al„ 1997), and references disclosed flierein). The 
activity of the polypeptide can be screoied and confirmed iising methods known 
in ih& art, later developed or d^cribed herein, Foi- example, antibodies tliat bind 

15 with active portions or fragments of an activity cart be used to ideniii}' 
appropriate polypeptides. AJtematively, substates for an acti\'it>\ such as 
substrates that are detectably labeled, can be used to detect the binding of a 
substrate to an activity or the conversion of a substrate to a product 

A polypeptide of the present inv^ition can be of any length, but is 

20 preferably between about 10 amino acids and about 300,000 amino acids in 
lengfti and more preferably bet\v'een about 100 amino acids and about 100,000 
amino adds in lengfe or between about 1,000 amino acids and about 30,000 
amino acids in length. 

The polypeptide of the present iiivewtion cm be made using recognized. 

25 methods, such as by way of recombinant methods as they are knou'u in the art 
(see, Sambrook et al, 1 9S9; Ausubel et al, 1998) or by digesting proteins or 
polypeptides. For example, nucleic acid molecules encoding or suspected of 
encoding a polypeptide of the present invention can be cloned into expression 
vectors that are feansformed into appropriate hc^t cells where the nucleic acid 

30 molecules are expressed. The resulting polypeptides can be optionally purified 
and tiieir activity confirmed lising methods of the present invention or as they are 
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kiiouai in the art or later developed. Altemativeiy, the in v/'vo activity of 

p olypeptides can be confirmed usmg methods of fee present mveiitioa or as they 

arefcnox^Timthsart 

A polypeptide of ti>e present ;m "Ciilior, can he provided ex vivo or witbiu 
5 a Ceil. A polypeptide of the present invcjnrioii can be expressed v/Miin a cell by 
trajisfecting a ceil with a nucleic acid molecule that encodes a polypeplide of the 
prescsit inveatioa. Tho nucleic acid molecule of the preseait invmtioa can be 
oper^ly liuked to sKqjression control se^jueoces appropriate for tbe cell such that 
the nucleic acid molecule of the present invention is expressed on or wiftiin the 

1,0 cell. The nucleic acid molecule can also encode a fusion protein such that the 
fusioB protein is expressed on or within tlie cell. In tlus instance, a fusion 
proteiji that includes a detectable label as tlie polypeptide of interest can be used 
to confirm tlie presence or track the location of the fusion protein in the cell. 

A polypeptide of the present invention can also be part of a fusion protsin 

1 5 that includes a polypeptide of the pr^nt inveantion and a polypq>tide of interest. 
A polypeptide of interest cm be any polypeptide, but is preferably a detectable 
label, such as green fluorescent protein, or a seqnence that aids in the purification 
of a polypeptide* such as FLAG. A fijsion pmtein that includes a polypqjtide of 
the present invention can be made from a nucleic acid that encodes a fusion 

20 protein can be made by operabiy linking a nucleic acid that encodes a 

pol>peptide of interest with a nucleic acid that encodes a polypeptide of the 
present invention. The opearable linking can be direct or indnect, such as in the 
case vifhere a linker connects the polypeptide of ihe present invention with a 
polyp^tide of interest. The nucleic acid molecule of the present invention and 

25 the nucleic acid that encodes a polypeptide of interest are preferably operabiy 
linked in firame sudi that an oper^le polypeptide of the present havention and an 
operable polypeptide of latere^ are translated from the nucleic acid, but that 
need not be the case. The present invention also includes such fusion protsiiK or 
libraiies of such fusion proteins. 

30 The present invention also includes a library' of poi>i?eptides of the 

present invention, A library of poljpeptides of fee present invention, including 
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{usmn proteins, mclBdes between about two, about four, about six, about eight, 
atoout ten, about tJiirty, about seventy, about one-huiidred, aboiit one-tJiousand, 
about teo-thotisaQd, about ons-huadred thousand or about ojie-million 
polypeptides and about three, about five, about seven, ^out twenty, about fifty, 
5 about five-hundred, about fifty-thoasatid, about five-hundred fhousand and about 
ten radllion polypeptides. Tfie membets of such a Hbrary are preferably diiSerent 
poI>peptides, but that need not be the case. 

IV. Aiitibodses that Biad mth a Bacteria of the Present laventioa or a 

1 0 Polypeptide of the Present In veation 

The present invention also includes aatibodies that bind with or 
specificaliy bind w-ith a bacterial or polypeptide of the present invention. The 
present invesntion also includes active fi-agm^ts of such antibodies and methods 
of detecting bacteria of &e present invention and polypeptides of tJie pi^^^ent 

1 5 invention. In such metbods, the binding of an <mtibody to a target can be 

detected using direct or indirect detection methods, preferably using a detectable 
label, such as an enzyme, particle, or fluorescoit moiety such as green 
fluorescojt protein, 

20 Bacteria 

The present mvention also includes aatibodies that bind with or 
specifically bind with a bacteria of the present mvention. Such antibodies can be 
polyclonal or monoclonal and cm be made using methods known in the art (see, 
Harlowe and Lane, i98S). The antibodies can be whole antibodies of ^ty class 

25 or subclass or mixture thereof, and caji include active fragmen ts of such 

aatibodieg, such as Fab fragments made using methods kuovvn in the art. The 
specificity of such laitibodies can be screwed and confrimed using assay ibnaats 
hnmvn in the art, such as ^yrne linked icamunosoibent assays (ELISAs) or 
other appropriate immunoassay fonnats. Such antibodies are usefitl in detecting 

30 the presence and/or amount of such bacteria in a sample. 



33 



The present mvaRtion also inchides antibodies tliat bind witb or 
specificaiiy bind with a polypeptids of the present inveation. Such antibodies • 
can be polycional or monocloiml and cm be made using metliods laio\?ii in the 
5 art (see, Hariowe md Lane, 1988). The antibodies can be whole antibodies of 
any class or subcl^s or mixture thereof; and can incMe active ftBgments of 
such antibodies, sach m Fab fira^ents made using methods known in the art. 
Hie specificity of such antibodies can be screened and confirmed using assay 
fomiats known in the art, such as using enzyme linked immunosorbent assays 
1 0 (ELISAs) or otiier appropriate immunoassay foranats. Such antibodies are useful 
in detectttig the presence and/or amount of soich polypeptides in a sample, 

V. Method of Making Eeteinascidias Using Bacteria, Nnclek Acids or 
Polypeptides of the Present laventioii and Compositions Made Thereby 
15 The present invention also includes a method of making a composition 

incltiding providing at least one bacteria, nucleic acid molecule or polypeptide of 
the present invention, and synthesizing at least one ecteinascidin or precursor 
thereof 

20 Bacteria 

At Jeast one species of bacteria of the present invention can be used to 
perfoi-m at least one step in the synUiesis of an ecteinascidin. ITie ecteinascidin 
or ecteinascidin prosursor or precuisors synthesized by this method can be any 
previously kaowj or unknown ecteinascidiii, including novel ecteinascidins 

25 sya&esized by modification of tlie biosj'nthesis pathway of a naturally occurring 
ecteinascidin, and including novel compounds synthesized using nucleic acids or 
polypeptides of the presmt invention that participate in ectemascidhi synthesis. 
The bacteria may be a .member of any bacterial species, such as species 
belonging to the eiibacteria or aichaebacteria, tliat s^'nthesize an ecteiiiascidin, at 

30 least in part. Preferably, tlie bacteria are maiine bacteria, such as sj'mbioRts of 
the tunicate EcMmsdim turbinata. The bacteria are isolated irom their natural 
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growth, cnyiromiieBt such that they are cultured in the absence of their host 
organisiB, Bacteria of ths presotit mvention may or may not be traasformed wiQi 
nucleic acids from other organisms, such as other bacterial species. Polypeptides 
encoded by such nucleic aciife may participate in fhe syntiiesis of ecteinascidias, 
5 includijig ecteinascidins not iiaf arally sjoithesized by tfae cultured bacteria, 
iticltiding modified forms of mtvjrally occnmng ecteinascidins. 

Altemativfeiy, the bacteria may be novel or previously identified bacteria 
timt do not naturally s>iithesi2e ecteinascidins, but are traRslbmed with niicleio 
acids of the present invention such that expression of tbeimcleic acids allow ^hs 
10 bacteria synthesize one or more ecteinascidins or one or more ecteinascidia 
precvttsors, Ttie bacteria may be of any bacterial species, such as species 
belonging to the eubacteria or archaebacteria. la addition, novel or previously 
idesatified bacteria, such as Streptomyces lavendulae, Pseudomonasflmrescens 
and Myxococcus xanthus, tihat sj-nthesize componnds similar to ecteinascidins, 
1 5 may be transfonaed with nucleic acids of ^e present invention such that the 
bacteria produce novel compomitte. 

Isolation, p-uritiealion, and cultiiring of bacteria can be accomplished 
■using standard microbiological techniques as they are kmmix in tlAe art. Sterile 
technique can be esnployed to prevent tlie introduction of exogenous bacteria of 
20 fimgj into the preparation. Hie bacteria of interest may be isolated from E. 
turbinata by disrupting or homogenizing a preparation ofE, turbmata, 
Dismption or homogemzation may be done by crashing, grinding, slicing, or 
ruptuiiag the E, m-bmata. This may be done manually, with or without 
impiements such as homogenizers, rasasr blades, scalpels, mortar and pestle, and 
25 the like, or may he done with any of a variety of machines such as, but not 
limited to> a blender or a motorized homogsuizer. 

The dismptate or horoogenate may be filtered to remove large partxcies or 
debris from the preparation, hi this case the filter has a pore siae large enotigh to 
allow bacteria to pass throng. Centrifiigatioa may be used to sediment the 
30 bacteria. Preferably tins is done after removing large psaticles fiom the disruptate 
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or hornogenaie. Ate centrifugation, the supemat-aiit is removed and the pellet 
coatainiiig fiie bacteria may be r esaspeiidLed in. a buffer or media. 

The buffer or media to be used in lesu&pesidiag fee bacteria can be aiy 
media, but preferably is a media that promotes the growth of tbe bacteria, t he 

5 media optionaJiy coiibans a carbon source, a nitrogaa source, salts, extracts sacb 
as 3'east, beef, or liver extract, and/or yitamins. It may be desirable to itiokde 
extracts of tbe £ m-btnata host organism, in the media. Sucb extracts maybe 
taken Srom the disraptate or homogeriate superaatant, or may be fractions of an 
tnrbmuta disrapiate or homogenate obtained by one or more ceiitrifugatiom, 

10 column cbromatography steps, precipitalious, dialj'ses, or other fractiotiation or 
pimfioatiou steps. Alternatively, samples of E. tmbinata cm be processed by 
sterilization, such as by aiUoclaving, a«d the resulting preparation used in 
microbiological mediuin by the optional inciusion of a gelling agent, such s& 
pectin, agar or gelatin. Other ntttdeate be added to such medium, such as 

1 5 vitajtBins, minerals and carbon and/or nitrogen sources. Optionally, an ocean 
environment can be created by adding autoclaved or filter sea-water, or artificial 
sea water (such as histant Oceaii) to such media. 

Cultures may be obtained by plating the bacteria on solid or semi-solid 
media or by propagatittg the bacteria in liquid media and incubating the bacteria 

20 at a teraiperature, pressure and aimospberio conditions &at allows the bacteria to 
propagate. Sach cnltar© conditions may mirmr or be samiiar to those that the 
microorganism encounters in its native environment. 

Pure cnteres may be obtaised by a variety of art recognized methods, 
such as sequentially streaking colonies of bacteiia from the cnlture. Cultures 

25 enriched for a pariicidar microorganism can be obtained by culturing a sample in 
a selective or di,fferential microbiological mediimi such that the relative 
proportion of a particular microorganism is intareased relative to otlier 
microorganisms. Such enrichment procedures can be used restoratively to 
provide for erarfched cultures of varying purity. 

30 The bactma may be cultured on solid or sani-solid media or in liquid 

media, but once purijSed, die bacteria are prefsstably cultured in liquid media. If 
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the mediuiii_ is a solid or semi-solid medium, a gellilig agent; such as pectin, agar 
or geiatin is incladed. The medium optiooally includes a nitrogeQ source, salts, 
exixacts smh as ymst, bee^ or liv^ extract, aaid/or vitamins. It may be desirable 
to iaclade extracts of the E. turbinata host organism in the media. Sucb extracts 
5 may be taken from the dismptate or liomogenate superaataat, or may be jfractions 
of an E. turbinata dismptate or homogeaate obtained by one or more 
centiifugations, column oteomatograpliy steps, precipitations, diaiyses, or ofeer 
fractionation or purificatj.on steps, AUematively, samples of £. Uirbinata catibe 
processed by steriHzaiioji, soch as by autoclaviKg, and tiie resnlting preparation 

1 0 used in microbiological medium. Optionally, m ocean environment cm be 
created by adding autoclaved or filter sea-watsr, or artificial sea water (such as 
Insiant Ocean) to such media. When cultured in liquid media, tlie cultures may 
be shaken for aeration, or air may be injected into the culture through one or 
more tubes to irUioduce oxygen into the culture, if such bacteria are facultatively 

1 5 aerobic or aerobic in met^oHsm. Prefembly, the roedia used to culture sudi 
microoi-ganisms mirrors or is related to the natural environment of the 
microorganism. 

For pix>duction of ecteiaascidins, bacteria maybe- cultm-ed under 
appropriate conditions of temperature, salt, nutrients, pH, and aeration. The 

20 growth media preferably contains compounds that pramote optimal growth of 
the bacteria, such as but not limited to, salts, sugars, amino aci<fe» vitamins, 
extracts, and'or buJGforiag agents. The bacteria may be cultored under 
fermentation conditions such &at they may be grown to high density, in 
fcamentative growth, it maybe desirable to add compounds, such as sodium 

25 hydroxide, to stabilize the pH of the media as the cells grow. Alternatively, 
media may be exchanged during the period of culturing such that the growth 
media retains optimal levels of pH and. nutrients. 

Precursors of ecteinascidins, such as but not limited to amino acids, may 
be added to the bacterial media. Ecteinascidin precursors, and oiher compounds 

30 sadh as nutrients, maybe added to the culture one or more times before and/or 
aSer inoctilation of the media with bacteria. 
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Culture media may be harvested duxiftg the period of growtli for isolatioa 
of ectemascidim. Altsmatively, the bacteria, the media, or both may be 
harvested at file of the ^owth period. 

EcteinascidiRS or ecteinascjdin precmsors may be isolatM and pmfied or 

5 partially purified from the media, bacteria, or both mmg appropriate methods as 
tliey are know or developed iii the art. Such methods may include oae or more 
steps of one or more of the foUoVving; centri fugation, phase separation, 
precipitation, chromatography, dialysis- The piority of the ecteiaasciditis or 
ecteinasoidin precursors rnay be evaluated by chemical or biophysical means, 

10 such as m or 2D NMR and tlieir acti\'ity assayed by antitumor, anticancer, 

cytotoxic, antibacterial^ or antiviral assays as they are known in the art, including 
those disclosed in aS, patents 5,256,663, 5,47S,932,5,4$4,717, and 5,089.273. 

The present invention also includes compounds made or identified using 
the present inveniion, For example, Hie present invention inchides ecteiaascidins 

1 5 made using at le^^t one mefibod of the present inv^tion . A compound made or 
identified using a method of the present invention can be a novel or non-novel 
compound. For example, a compound of the present invention can include a 
compound that was not or was not novel on the date of the filing of Ihe present 
application, or one year or six months prior to the filing date of the present 

20 application, 

A cojBpoimd of the presexxt invention can be provided with at least one 
phannacentically acceptable carrier as they are known in the art and discussed 
herein. Such pharmacenticaiiy acceptable carriers are toown in the art and are 
disclosed herein, A compound of the present invention can also be a 
25 phannaceatical composition. 

Nucleic Add Molecules an d PoIvpevUdes 

At least one nucleic acid molecule of the present invention or at least one 
, polypeptide of th e present invention can be expressed and used m a system to 
30 synthesize an ectemascidin. The ectdaasoidin or ptecnrsors thereof can be 
previously known or unknown ecteinasoidin or precniBors thereof. 
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The present invmtion atili?.es at least one nucleic acid niolecale of ttie 
present jnyeiition and/or at least one poi>rpeptjde of the present invmtion in such 
methods to make teowii or novel ecteinascidms or related compomds, iiiciudiiig 
Et-743. The present invmition can utilize at least one nucleic aoid molecule of 
5 the present invention and/or at least one polypeptide of the present kveniion 
alone or in coinbination with o&er g^es. These polypeptides and genes can be 
known or later developed and can be derived from marine, aquatic or terrestrial 
organisms. 

For example, some methods of sjntbesidng ecteinascidins may pro'vide 

10 cassettes that include a gene complex that encodes at least one eeteinascMin 
synthesis activit>'^ such as a non-ribosomal peptide sviithetass activity, either in 
whole or in part. Nmcleic acid molecules of the present iiwention can be inserted 
into such cassettes midomly or non-raadomly, including replacing identijSed 
gcaies. Wh&o. nucleic add molecuies of the present invention are inserted non- 

1 5 randomly into such cassetteSj they can be inseated in-frame to replace existing 
genes that encode polypeptides that have functions shnilar to Uie polj'pqjtide 
encoded by a nucleic acid of the present invention. The nucleic acid molecules 
of the present invention are thus expressed as polypeptides of the present 
im-eiitioa which can act as part of a g^e complex to produce kaovm or novel 

20 compounds, such as ecteinascidins. 

Cells or extracts thereof (such as substantially purified exfe-acts) tliat 
include one or more of the nucleic add molecules of tiie present invention or one 
or more polypeptides of the pi^esent invaation can be used to synthesize a wide 
variety of ecteinascidins including Et-743. Such cells or extracts thereof can be 

25 contactetl with a variety of compounds, mcluding substrates 3for an ecteinascidia 
syafcesis activiiy, particulariy an ectdaascidin synthesis acSvity present in the 
cells or extracts feereof. Polypeptide expressed &om nucleic acids of the 
present invention can act on these compounds in order to make a ^-ide variety of 
ecteinascidins including Et-743. hi one aspect of the present invention, more 

30 than one cell and/or exfa-act thei^eof can be used in combination or sequentially 
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such tiiat tbe products made by combinatjoii of ceEs or extracts can be 
determiaed and its activity confiraied. 

Alternatively, polypeptides expressed &om nucleic acids of the present 
itivsijtion can beprodiTced and purified or substanlialiy purified for use in the 
5 maaufacture of one or more ecteinascidins, one or more ecteinascidin precursoi^j 
or one or more sctdnascidiB related compoimds, ixicludrag novel compoimds. 
The polypeptides may bs produced by expression of fee lujcleic acids of the 
pr^ent invention in any suitable system, including in vitro iranscriptioa aad 
translation systems and in viva expression systems. For example^ a polypeptide 

10 of the present invention can be expressed wiiiiin a cell by teansfecting a cell with 
a nucleic acid molecule that encodes a polypeptide of the pr^eait invention. The 
nucleic acid molecnle of iJie present invention can be operably linked to 
expression control sequencers appropriate for the cell such that the polypeptide of 
the present iBvention is expressed on or within the cell, or the pohpeptide of the 

15 present invention tliat is expr«sjsed witliin a cell is secreted from the cell. Cells 
for expression of a pol vpeptide of tiie present irweation include but are not 
limited to bacterial cells, such as E. coli or Bacillus ceils, yeast cells, such as 
Saccharomyces cerevisiae, Pichia pastom^ mdPicfua tneihoUca, eukaryotic 
cells, such as plant cells, insect ceils, or mammaHaa cells including without 

20 limitation COS-1 cells (ATCC No, CEL 1650), COS~7 cells (ATCC No. CRL 
1651), BHK cells (ATCC No. CRjLi632), BHK 570 ceUs (ATCC No. CRL 
10314), 293 ceils (ATCC No, CRL 1573) and CHO-Kl cells (ATCC No. CCL 
61). Transformed or transfected cells axn be cultured according to conventional 
procedures in a culture mediiun containing nutrients and other components 

25 required for the growtli of the psrticidar host cells, and preferably, tor the 
selection of cells containing the exogenously added nucleic acid. 

A polypeptide of the present invmtion may be purified from the cell 
culture using well-established methods of protein pmificationj preferably 
including affhiity purification (see, for example, Scopes, 1994; 19S8). 

30 The present invention also includes compounds made or identified using 

the presmt invention. For example, the present invention includes ecteinascidins 



40 



wo 



FCT/lJS«2/e3703 



made using at least one method of tie preseat inveaticKQ, A compoimd inacle or 
identified using a method of the present iaveation can be a Bovel or non-novel 
confound. For example, a compomid of tJie present invention can mciude a 
compoimd that was or was not -novel on the date of the filing of the present 
5 application, or one year or six months prior to ihe filing dates of the present 
application. 

A cojnpound of tiie present invention can be provided with at least om 
phannacentjcally acceptable carrier as they are imown iii the art and discussed 
herein. Such pharmaceutically acceptable caniers are known in the art: and are 
1 0 disclosed hemxL A compound of die present invairtioii cm also be a 
pharmacoatical composition. 

VI. Method of IdetttJfyittg Nucleic Add Molecules, Nucleic Acid 
Molecules Ide»tified Thereby a»d Libraries of Nttcleic Acid Molecoies 

15 The present invmlion also includes a method for idsntif^'ing at least one 

nucleic acid molecide encoding at leasl one ecteinascithn s^iithesis act^^'ity 
including contacting a nucleic acid molecule of tlie present invention with a 
sample, and identifying nucleic acid molecule in said sample that hybridize with 
said nucleic acid molscule of the present inventioa. This aspect of the present 

20 invention ulilkes nucleic acid molecules of the pres^at invention as probes or 
PCK. primers in order to identify nuoieio acid molecules that have or are expected 
to encode polypeptides that have ecteiii««cidiii synthesis activity. 

Samples for use in the present invention can be from any source that can 
include a nucleic acid molecule^ but preferably include samples fi-om an 

25 environmental sample, such as the marine eawoiunent. The samples can 
include mariiie organisms, including uwertebrates or vertebrates or any other 
marine organism. Prefsrabiy, the sample includes single ceEed organisms, such 
^ bacteria. More preferably, the san^le includes samples that are expected to 
contain an ecteinascidia. Such samples include, for example, Ecteinascidia 

30 species, including Ecteinascidia turbinata. 
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When used as probes, nucleic acid molecules of the presmt mventioii 
can be detectably labeled and contacted with a sample. Nucleic acid molecules 
tJiat bind ^\ ith tho uucluv '.mu li- e p. ."scni m\ cnnon can bt- idcnufied, cloned 
and scquynLSti usmg method > Li^.Dv-ii m the ait, 1 he idcuhtted nuclejc acid 
5 molecu'es can be operably linked to sxpre£>siun contiol ^tfqL1euccs such that a 
polypeptide eucoded by the identified nucleic acid molecule can be mads and 
characterized. 

When used as PCR primers, the nucleic acid molecules of the present 
iiwentioii can be used to amplify nucleic molecules in a sample. Oae or more 

1 0 than one of ihe primers used ia one or more amplification reactions can be a 
nucleic acid laolecule of the present iBvmtioH- The amphlied nucleic acid 
moiocxiles are presumptiveiy derived &om a gene encoding an eoteinascidin 
syntliesxs activity. The amplified nucleic acid molecules can be identified, 
cloned and sequenced using methods known in the mi. The ampliified nucleic 

1 5 acid itioiecules can be operably linked to expression control sequences such that 
a polypeptide encoded by the amplified nucleic acid molecule can be made and 
characterized. 

The nucleic acid molecules of the present invention can also be used to 
identify nucleic acid moiecules that are upstream or down^eatn from a targeted 

20 segment ofa gene encoding an ecteinascidinsyjithesisactivit^f. A nucleic acid 
molecule that encodes a consented region of a gene encoding an ecteinascidin 
synthesis activity. A nucleic acid molecule of the present invention can be used 
in primer extension or inverse PCR methods such that upstream or downstream 
segments from the point of hybridigation are identified. These sjctended 

25 segments can be idmtified, cloned md setpienoed using methods known in the 
art The extended se^nents can be operably linked to expression control 
sequences such that a poljpeptide eDooded by the extended segment can be made 
and characterized. 

Whether the nucleic acid molecule of the present invention is used as a 

30 probe, primer or PCR primer, tlie methods of the present invention identify 
nucleic acid moiecuies that presumptively encode at least a portion of a gene 
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eaccxJing m ecteiuaseidin sytsiiiesis aemity. Any of these processes cm. be used 
alone, in combination or restoratively to idaatiiy at le^t portions of genes 
encoding an ectemascidin synthesis activity in a sample. 

The presmt mventioa ako inctodes nucldc add molecules 
5 the present invention, Th& identified nucleic acid moiecules can include 
expression control sequences operably linked to the identified nncieic acid 
tnoiecnles. Such constructs cm. be used to make po3>peptides encoded by the 
identified nucleic acid molecules and ib& polypeptides can be cliaractsrizsd as to 
a variety of stmctures and functioi^, paxticalaiiy structures and fimctians 
10 associated wiili genes encoding an ectein^oidin synthesis activity. Tlie present 
invention includes a lilsrary of nucleic acids, cells or polypeptides identified by 
the present invention, 

VH. Method of Identifying a Bioactive Compound, Bioactive Compoands, 

15 and Therapeutic Gompositions 

The present invention also includes a metliod for iden^^^yiI^g a bioactive 
compound, such ss those made or identified using a bacteria, nucleic acid 
molecule or polypeptide of the presoit invention, including contacting a 
componnd made or identified by the present inveation with at least one in vitro, 

20 ex vivo or in vivo assay system and detemming the bioactivity of said 

componnd. The present invaation includes bioactive coir^otmds identified 
i^ing tliis method. The identified bioactive compounds can. be provided in a 
phaimaceuticaliy acceptable canier and cm be a pharmaceutical compound. 
In vitro, sx vivo and in vivo systems used in the present iiivexitioa are 

25 preferably those known in the art for a bioactivity to be identified. The assay 
chosen to be used in this me&od is related to a bioactivity that is being screened 
tor. Preferred systems include fliose that determine at least one ecteinascidin 
activity. For example, in vitro systems (systems that do not use whole organisms 
or whole cells) and ex vivo systems (systems that use whole cells or portions of 

30 cells) fox the identification of ecteinascidin activity are known in the art, 

including fliose disclosed in U.S. patente 5,256,663, 5,478,932,5,484 j 17, and 
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5,089,273, In vivo systos (systems that use whole organisms or tissues or 
organs derived therefrom) also ktiown m the art (see, for example, Valoti et 
aL, 1998). Compounds are identified iismg these metiiods as having a desired 
bioactivity . Compounds identified by feese methods are bioactive eompooiids 
5 that have at least one bioactivity. 

Screeinns ofcomvounds foracttvities 

The following assays can be performed to confirm the bioactivity of a 
compound: 

10 a) antimicrobial eiFect on S. aureus by placing a compound on a paper 

disk and det^mimng iiie ability of the compound to inhibit the g^o^v^h of the 
aureiLs (Benson, 1994). The results of this assay establish the toxicity of the 
compoimd towards Gram-^wsitive bacteria. 

b) aiitimicrobiai effect on £. coli by placing a compound on a paper 

1 5 disk and determhnng the ability of the compound to inhibit the growth of the E. 
coli (Benson, 1994). The results of this assay establish the toxicity of a 
componnd towards Gram-negaiive bacteria. 

o) antimicrobial effect on Candida albicans by placing a compoimd on 
a paper disk and detetmining the abili^' of the extract to inhibit the growfli of 

20 Candida albicans (Benson, 1994). Ihe results of this study establish the toxicity 
of compounds towards >'easts and fungi. 

d) iniiibition of the giw\th of cancer cells by contacting a compound 
with the National Cancer Insti tute's (HCI) cell line screen (approximately sixty 
ccH iiaes) against up to fifty-one cancer cell types in vitro (Boyd et al., 1995). 

25 These results pro\'ide an activity profile for the coHqjonnd, The activity profiles 
. of an extract can be compared to the activity profiles of other samples in the NCI 
datab^ of activity profiles. Similar activity profile of different extracts, 
iiicluding biown extracts wife known modes of action, strongly suggests that the 
samples have simiiar mod^ of action. A novel activity profile strongly suggests 

3 0 that the compound has a novel mechanism of action. 
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e) cytotoxic activity of a compound can be detemmed by a variety of 
methods, mcluding inbifoMon of brine shrimp by contacting tp,fenty-four liour old 
brine shrimp .nauplii for iwenty-foiir hours with an compound and observing tlie 
inhibition of the activity or viability of the brine shrimp. The results of this 
5 assay estabiish the cytotoxicity of the compound towards whole organisms. 

Compounds identified as having a bioac^vity have presximptive 
tha^eutic activity. Such therapeutic activity and related phanjiacologicai 
parameters cm be confiraied using the methods discussed herein. 

Pter;?Mgo/6'£;>' gyid toxicity qfbioa ctive ccnpomds and bioactmties 

The structure of a bioactive compound or bioactivity can. be determined 
or coiiilnned byBietbods k-nown in the art. such as mass spectroscopy. For 
bioactive compomids and bioactivities stored for extended pmods of time under 
a variety of conditioss, fiie structure, activity and potency thereof can be 
15 confirmed. 

Identified bioactive coaqjounds and bioactivities can be evaluated for a 
particular activity using recognized methods and those disclosed herein. For 
exaniple, if an idtaitified bioactive compound or bioactivity is found to have 

ajiticaiicer celi activity in vitro, then the bioactive compound or bioactivity 
20 would have presumptiA^e pharmacological properties as a cheiBOtherapeutic to 
treat cancer. Sucii nexuses are known in. the art for several disease statesj, and 
more are expected to be discovered over time. Based on such nexuses, 
appropriate confirroatory in vitro and in vivo models of pharmacologicai acMty 
and toxicology can be selected and a^opriate tests pert>imed. The methods 
25 destcribed herein can also be used to assess pfoaimacological selectivity and 
specificity, and toxicity. 

Identified bioactive compoirads and bioacti\dtjes can be evaluated for 
tDxicological effects using kno-^^u methods (see, Lu, 1985; U.S. Patent Nos; 
5,196,313 to Cuibreth (issued March 23, 1993) and 5,567,952 to Benet (issued 
30 October 22 , 1 996)), For example, toxicology of a bioactive compound or 

bioactivity can be estabHshed by dstfimining in vitro toxicity towards a cell line. 



45 



sucb as a manimaliaii^ for exaitple huniaai, cell lim. Bioaciive compoimds and 
bioaotivitie-s can be treated wilii, for example, tissue extracts, such as 
preparations of Hver, sach as microsomal prepamtioiis, to iieiermine 3!5crc;ased or 
deci-eased toxicoiogical p-opexties of the bioactive compound of bioacti vit}' after 
5 being metabolized by a whole organism. .The remits of these types of studies are 
predictive of toxicoiogical properties of cbemicais in animals, such as mammals, 
includittg liiBtnans. 

Altemativeiy, or in addition to these m vitro sftudies, tlie toxicoiogical 
properties of a bioactive compound or bioactivity in an animal model, such as 

iO mice, rats, rabbits, dogs or monkeys, can be detenBined using established 

methods (see, Lu, 1985; and Creasey, 1979), Depending on the toxicity, target 
brgaa, tissue, locus and presumptive inechamsm of fee bioactive compound or 
bioactivity, the skilled artisan would not be burdened to determine appropriate 
doses, IDse values,, routes of admimstration and re^es Ihat would be 

1 5 appropriate to detsnnine the toxicoiogical properties of the bioactivg compound 
orbioactivity. ti addition to animal models, bnman clinical trials can be 
performed following established procedures, such as feose set forth by the 
United Static Food and Dmg Administration (USFDA) or equivalents of other 
governments, These toxicitjr studies provide tlie basis for determining the 

20 efficacy of a bioactive compound or bioactivity in vivo. 
BMcucv of bioactive comt>ounds and hioactivities 

Efficacy of a bioactive compound or bioaclivity can be established using 
sevesrai art recognized methoifej such ^ in vitro methods, animal models or 
human clinical trials (see, Cnsasey, 1979), Recognized in vitro models exist for 

25 several diseases or conditions. For example, the ability of a compoimd or 

composition to extend the life-span of I:W-infected cells in vitro is recogmzed 
as an acceptable model to identify chemicals expected to be efficac}ou,s to treat 

infection or AIDS (see, Dakge et al, 1995). Furthermore, the ability of 
cyclosporin A (CsA) to prevent proliferation of T-cells in vitro has been 

30 established as an acc^table model to identify chemicals expected to be 

efScacious as immunosuppressants (see, Snthanthiran et al,, 1996). For nearly 
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every class of thei'^eutic, disease ox coiiditioji, aa acc^table in vitro or animal 
model is available. In addition, &ese in vitro jnetJiods can use tissue extracte, 
sach as preparations of iiver, such as microsomal preparations, to provide a 
reliable indication of the effects of metabolism on a bioactive compoisnd or 
5 bioactivity, SiTmlarly, acceptable asiimal iBOdels can be usai to establisb 
efficacy of bioactive cofnpoiiTids md bioactivities to treat vaiioiis diseases or 
conditions. For example, the rabbit knee is an accepted model for testing agents 
for efficacy iii treating artliritis (see, Shaw aiid Lacy, 1973). Hydrocortisoce, 
which is ^proved for use in humans to tre^t arthritis, is efficacious in tltis model 

10 which con&ms the validity of this model (see, McDonough, 1982), When 
choosing an appropriate model to determine efficacy of bioactive compovmds 
md bioactivities, the skilled artisan can be guided by the state of the art to 
choose an appropriate model, doses and route of administration, regime and 
endpoint and as such would not be unduly burdened. 

15 In addition to aniinal Bvodels, luiman clinical irial s can. be used to 

detennine the efScacy of bioactive compoimds aad bioactivities. The USFDA, 
or equivalent govenrtnental agencies, have established procedures for such 
stadies. 

Selectivity of bioactive compomds mid bioactivities 

20 The in vitro and in vivo methods described above also establish the 

selectivity of a candidate modalator. It is reco^zed that chemicals can 
modulate a wide variety of biological processes or can be selective. Panels of 
cells as tliey are known in the art can be used to detennine &e specificity of the a 
bioactive compound or bioactivity (WO 98/13353 to Whitney et al., published 

25 April 2s 1998). Selecti^'ity is evident, for exasnple, in the field of chemotherapy, 
where the selectivitj? of a chemical to be toxic towards cancerous ceils, but not 
towar(fe non-cancerous cells, is obviously desirable. Selective modulators are 
preferable because tliey have few&c side effects in the clinical setting. The 
selectivity of a bioactive compound or bioactivity can be established in vitro by 

30 testing the toxicity and effect of a bioactive compound or bioacti vity can be 
established in vitro by testing the toxicity and effect of a bioactive compound or 



47 



wo 02/064843 



bioactivity on a plurality of cell lines tliat exhibit a variety of cellular path ways 
and sensitivities. The data obtained form these in vitro toxicity studies cart be 
extended to anifiial .model studies, indiidiag bumaii climcai trials, to detemdne 
toxicity^ efficacy and selectivity of abioactive compound or Moactivity. 
5 Hie selectivity, specificity a»d toxicology, 8S well ^ the genei-al 

phannacology, of a bioactive compound or bioactivity can be often improved by 
gmerating additional test chemicals based on the structure/property relatioaship 
of a bioactive componnd or bioactivity originally identified as having activity, 
Bioactive compooRds and bioactivities can be modified to mipxove various 

10 properties, such as affinity, life-time in blood, toxicology, specificity and 
membrane permeability. Such refined bioactive cotripounds and bioactivities 
cm be subjected to addifional assays as they are kiiown in the art or described 
herein. Mi^ods for generating and an^yzing s«ch compounds or compositions 
are known in the art^ such as U.S. Patent No. 5,574,656 to Agrafiotis et al. 

15 Pharmaceutical c(^mpositions 

The pi-esent invention also aacompasses a bioactive compoimd or 
bioactivity in a phamiaceutical composition comprising a pharmaceutically 
acceptable carrier prepared for storage and preferably subsequent administration, , 
which have a phartnaceuticaUy effective amount of the bioactive compound or 

20 bioactivity in a pharmacentically acc^table carrier or dilumt Acceptable 

carriers or diluents for therapeutic use are well known in the pharmaceutical art, 
and are described, for example, in Remington's Pharmaceutical Sciences, Mack 
Publishing Co., (A.R. Geimaro edit (1985)). Preservatives^ stabilizers, dyes and 
even flavoring agents can be provided in ihe pharmaceutical composition. For 

25 example^ sodium beuzoate, sorbic acid and esters of p-hydroxybeczoic acid can 
be added as preservatives. In addition^ antioxidants and suspending agents can 
be used. 

The bioactive compounds.^d bioactivities of the present hivention can 
be formulated and used as tablets, capsules or elixire for oral administration; 
30 suppositories for rectal administration; sterile solutions, suspensions or 
injectable administration; and &e like. Mjectables can be prepared in 
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coxwentional fonns either as Hquid solutions or suspensions, solid forais suitable 
for solution or suspatision in liquid prior to iajection, or as emulsions. Suitable 
excipisiits are, for example, water, saline, dextrose, maimiiol, lactose, lecitiun, 
albiimii), sodhim glutsanate, cysteine hydrocMoride and the Mke. In addition, if 
5 desired, injectable phamiaceutical coinpositioBs cm contain xniiior amounts 
of nontoxic auxiliary substances, sucJi as wetting agents, pH b ufferiiig agents and 
tie Mke. If desired, absorption enhancing preparation, sacb as Eposomes, can be 
used. 

The phamiaceutically effective amount of a bioactive compound or 

10 bioactivity required as a dose will dep^ on tiie route of adniinistratioa, the type 
of animal or patient being treated, and the ph>^icaJ characteristics of the specific 
animal imder consideration. The dose can be tailored to achieve a desired el&ct, 
but will depend on such factoira as weight, diet, concuirent medication and other 
fectors which those sldlled in the medical arts will recognise, to practicing the 

1 5 methods of the present invention, tlie pharmaceutical compositions can be iised 
alone or in combination witli one another, or in combination with othea: 
therapeutic or diagnostic agents. Th^e products can be utilized in vivo, 
preferably in a mammalian patient, preferably ia a huniaai, or m vitro. & 
employing them in vivo, tbephamaceutical compositions caiibe administered to 

20 the patient in a Yssi^ty of ways, including parenterally, intravenously, 
Stibcutaneously, intramusculjiriy, colonically, rectaliy, nasally or 
intraperitoneaOy, emplo>iiig a variet\' of dosage fonns, Such methods can a^so 
be used in testing the activit>' of bioactive compounds or bioactivities in vivo. 
As will be readily ^parent to one skilled in tiie art, tlie usefid in vivo 

25 dosage to be administered and the particuiar mode of administration wiU vary 
depending upon the age, wei^t and type of patiraat bein^ treated, the particular 
pharmaceuticaj compoation employed, and fee specific use for which the 
pbaimaceutical composition is employed. The detennination of eiffective dosage 
levels^ that is the dose levels necessary to achieve the desired result, can be 

30 accompfished by one sIdUed in the art using routine methods as discussed above. 
Typically, human clinical applications of products are commenced at lower 
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dosage levels, with dosage Im^el bemg mereased until the dssired effect is 
achieved. AKemativelyj, acceptable in viti'o stu4ies can be used to establish 
aseM doses md routes of administration of the bioactive compoiind$ and 
Moactivities. 

5 Bi naa-human ^mai studies, applications of the phaimaceatical 

compositions are commenced at higher dose levefe, with tlie dosage being 
deoeased until fee desii-ed ef^t is no longer achie\fed or adverse side effects are 
reduced or disappear. The dosage for the bioactive compounds and bioactivities 
of the present jnven^on can range broadly dspeading upon tlie desired affects, 

10 the therapeuiic indication, route of adiBinistration and purity and activity of the 
bioactive compound or bioactivi.ty. T^picallyj dosages can be between about 1 
ng.'lcg and about 10 mg/'kg, preferal^ly between abo-ut 10 ng^g aiid about I 
mg%, more prefearably between about 100 ng/lsg and aboyt 100 microgranis/kg, 
and most preferably between about 1 missrogram/kg and about 10 

15 micrograms/kg. 

The exact formulation, route of administration and dosage can be chosen 
by the individual physician in view of the patient's condition (see, Fingle et al,, 
in lh& Phamiacological Basis of Therapeutics (1975)). It should be noted that 
the attemfeg physician would jaiow how to and when to terminate, inteimpt or 

20 adjust ateiinistiation due to toxicity, organ dy^sfunction or otiier adverse eiTects, 
Conver.wiy, the attending physician would also know to adjust treatment to 
higiier levels if the clinical response were not adequate. The magnitude of ati 
admimstratsd does m the management of the disord^ of interest will \'aiy with 
the severity of the condition to be treated and to the route of adimmstration. The 

25 severity of the condition may, for exampie, be evaluated, in part, by standard 
prognostic evaluation methods. Further, the dose and perhaps dose frequesicy, 
mil also vai>' according to &e age, body weight and response of the individual 
patiient, including those lor veterinary appHcatioiis. 

Depending on the specific conditions being tj-eated, such phammceutical 

30 compositsOES can be formulated and administered sj'stemicaily or locally. 
Tedmiqties for fomation and administration can be found in Remington's 
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Pharmaceutical Sciences, IStfe Ed., Mack PublisJakg Co., Easton, PA (1990). 
Suitable routes of administi-ation can include oral, n^al, rectal, trausdsTmal, otic, 
ocular, vaginal, ti-ansniacosai or ix^testiBal adnainisixation; parenteral delivery, 
including intrmjiuscular, subcMaxieous, intraiuedullary injections, as well as 
5 intrafliecal, direct intraventricuiar, intravenous, iatrapaitoneal, intranasal, or 
ilitraocalar irtjsetions. 

For injection, the phannaeeutieal compositions of tlie present iKvennon 
can be formulated in aqaeous solutions, prefer^ly in piiysiologically compatible 
buffesfs such as Hanks' solution, Kinger^s solution or physiological saline buffer. 

10 Fox such ti^ansinucosal admimstratioii, penetrans Expropriate to the banrier to be 
pmneated are tised m tlie fonrndation. Such penetrans are geneicaily known in 
the- art. Use of phamiaceatically acceptable carriers to formulate tbe 
phan-naceutical compositioBS herein disclosed for the practice of rbe invention 
Into dosages milable for sj'steinic administration is within the scope of the 

1 5 invention. With proper choice of carri^ aiid suitable mauufactaring practice, the 
compositions of the present invention, in particular, titose formulation as 
solutions, can be adiuinistered parenterally, such as by intravenous injection. 
The pharmaceutical composittoas can be formulated readily using 
phannaceuticaliy acceptable carriers well known in fee art into dosages suit^le 

20 for oral administrstioas. Such carriers enable the bioactive cotnpounds and 
bioactivities of the invention to be fonmilated as tables, pills, capsules, liquids, 
gels, synips, slurries, suspensions and the like, for oral ingestion by a patient to 
be treated. 

Agents intended to be administered hitracellularly may be administered 
25 using techniqaes well kaowa to those of ordinary skill in the art For example, 
^h agents maybe encs^sulatsd into liposomes, tiiien adnunirtered as described 
above, Substantialiy all molecules pr^ent in an aqueous solution at the time of 
Hposome formation are incorporated into or witJiin the Hposomes thus formed. 
The liposomal contents are both protected irom the eternal micro-environment 
30 and, bec^s© liposomes fose will ceil mmbranes, ate efficiently delivexed into 
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the cell cytoplasm. Additionalty, das to their hydropliobicity, sisall organic 
molecules cm be directly administered iinracellularly. 

Pharmaceatical compositioBs suitable for use iii tlie present inventioii 
include compositions wherein fee acti ve ingredieats are contained in an effective 
5 amount to achieve its intended piupose. Determination of the effective amount 
of a pharmaceutical composition is well within fee capability of those skilled in 
the art, especially in light of the detailed disclosure provided herein. In addition 
to the active ingt-edieBts, these phaimacenttc-a} compositions can contain suitable 
pharKiaceuticaliy acceptable carriers comprising excipients md auxiliaries which 

10 facilitate processing of the active chemicals into preparations which can. be used 
pharmaceuticaily. The preparations fonnulated for oral administTation may be in 
the form of tables, dragees, capsules or solutions. The pharmaceuticai 
compositions of the present invention can be manufactured in a mannear that is 
itself knoftfn, for example by means of conventional mixing, dissolving, 

15 granulating, dragee-making, emulsi^g, aicapsulating, entrapping or 
lyopMlizhig processes, Pharmaceutical formulations for parenteral 
administeatios include aqueous solutions of active chemicals in water-soluble 
foim, 

Additionally, suspensions of the active chemicals may be prepared as 
20 appropriate oily injection suspeasioiK. Suitable lipophilic solvents Oi- velricles 
include fatty oils such as sesame oil, or synthetic fatty acid estere, stich as etiiyi 
oleate or triglycerides or liposomes. Aqueous injection suspensions may contain 
substances what increase the viscosity of the suspension, such as sodium 
carboxymethj^ cellulose, soibitol or dextran, Opdonaily, the su^ension can also 
25 contain suitable stabilizeiB or agents that increase the solubility of the chemicals 
to allow for *e preparation of highly concentrated solutions. 

Pharmaceutical compositions for oral use can be obtained by combining 
tlie active chemicals witli sohd excipient, optionally grinding a resulting mixtm-e, 
and processing the mixture of ^anules, after adding suitable auxiliaries, if 
30 desired, to obtain tables or dragee cores. Suitable excipients are, in particular, 
fillers such ^ sugars, including lactose, sucrose, mannitol or sorbitol; cellulose 
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preparations sxicb as, for example, maize starch, wheat starch, rice starch, potato 
starcli, gelatin, gum tragacmith, methyl cellulose, hydroxypropylmethyl- 
cellBlose, sodium carboxyTnetiyiceriulose aiid/'or poIyvinylpjTTolidone, If 
desired, disintegrating agcxits can be added, such as the cross-linked poiy^'kyl 
5 pyrolidOBs, agar, algiaic acid or a salt thereof such as sodium alginate. Dragee 
cores can be- provided with soitable coatings, Dyos or pigments can be added to 
the tablets or dragee coatings for ideiitillcation or to ch^acterize different 
combinations of active doses. 

The bioactive compounds andbioactiivities of the present iaivoition, and 

10 pharmaceutical compositions that include such bioactive compoimds and 

bioactlvities are useful for treating a varieiy of ailiBents in a patient, including a 
human As set forth in the Examples, the bioactive compounds aad bioactivities' 
of the present invention have antibacterial, antimicrobial, antiviral, anticancer 
cell, arititimtor and cytotoxic activt^. A patient in need of such treatment can be 

1 5 provided a bioactive conipt>und or bioacti^'ity of the present invention, preferably 
in a pharmacological composition in an effective amonnt to reduce th.e number 
or growth rate of bacteria, microbes, cancer cells or mmor cells in said patent, or 
to reduce the infecti\'ity of viruses in said patient The amount, dosage, route of 
adrainistratioa, regime and endpoint can all be determined using the procedures 
20 described herein. 

Tlie invention will now be described by the following non-limiting 
exampl&s. 

25 idestilicatioaofBacteria Associated With 

BetemasddUt turbimta by in situ Hybridfeatton 

Larvae of i". turhinata for in siiu hybridization are fixed in 4% 
paraformaldehyde-0. 1 M MOPS (pH 7,4V0.5 M NaCL Fixed larvae are either 
used iminediateiy or stored in 70% ethanol at -20''C. The fixed lai^ae are treated 
30 with proteinase K {Boehringei- Mannheim), refked in 4% parafomaldelij'de, 
treated with acetic anhydride in 0.1 M tiiethanolanjine, waslied with buff^, and 
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incnUtad in h>%ridi2ation buffer (5X SSQ 0 J% Tween 20, 5 mM BDTA. md 
IX Dediardt's solution) at SS^'C for one- hour, Phiorescem-labded probes are 
added in fresh iiybridization bul&r at a concentration of 2 ng per microliter. The 
samples are heated to 55*C for five to ten minutes mid then incubated at 43 °C for 
5 4 to 6 hours. Each wash is ctmduotsd twice for 1 5 minutes as follows; high 
strmgency wash 1 is 10% foimaiatde, 5X SSG, aad 1% sodium dodecyi sulfate, 
used at wash soiutioii 2 is 10% formamide, 2X SSC, and 1% sodium 
dodecyi sulfate, used ai ZTC^ and wash solution 3 is 10% formamide, 2.X SSC, 
and 0.1% Tween 20, used at room temperature. The larvae are suspended iii 

10 Vectasheild (Vector) mounting media and viewed under epifluoresceiice. The 
probes used for hj'bridiaation are designed to hybridise to regions of tiie SSU 
rRNA gens (Haygood arid Davidson, 1997) and include the universal eubacterial 
probe BUB 338 (SEQ ID NO:4) (Amann et al., 1997), tlie E, palauemis SSU 
rKNA probe 1 247r (5'-AAGTAGCGGCCCTTTGTC-3') {SEQ ID N0:1), and 

15 the seriula probe Bnl253r (SEQ H) N0:9) {Haygood and Davidson, 1 997). 

Examffle.2 

Extraction of Nucleic Acid Molecules from Samples of 
E, tiirbinaUi that lacliide Bacteria 
20 Uuless othea-wdse noted, molecular biology procedures used are standard 

techniques used m die field (Sambrook et al., 1989; Ausubel et al, 1998). 
DNA extraction 

Samples of either adult or larval E. turbinata were obtained ftom the 
waters of the Florida Keys. Free-swimming larvae were collected in the lah fiom 
25 tm-hinata adult colonies. Tarval TMk was ^hracted as previously described 

'for B. nerituta (Haygood and Davidson, 1997). Larvae w^e concentrated in 1.5 
irdliiHter microcentrifuge tubes (approximately 25 mg of 1ar\'^ae per tube) by 
gentle centrifogation and then rinsed fonr times in filtered seawater (0.2 
micrometer pore size filter) to minimize the contaminating seawater bacteri a. 
30 Excess water was removed md the pellets were ftozen at -SO^C for latea- use. 
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DHA was extracted from the E. turbinata larvfae itskg a QL'\anip Tissue Kit as 
directed by &e maniifactarer (Qiagsn Inc., Valencia, CA). 

Examjpte 3 

5 Identificattoa of Geaes in Samples of JE. turbbmta 

tfaat ladude Bacteria 

Primer d^im 

Degeaerate PCR primers are designed based on adenylation domaiRS of 
non-ribosomal peptide synfiietase (3SIRPS) genes. PCR primers are designed as 

1 0 follows: 1) tjsiag published degenerate sequences for tbe A-domains, and 2) 

osirig published safraaiycin MXl sequences as a basis fox more specific primers , 
The adenylation domain is selected as the first target because it is tlie most 
iiighly conseiYed region of the MR.PS genes. la addition, it is the loadiiig domain 
for aaniiio acids and thus serves as the basis for amino acid selectivity in the 

1 5 NRPS g^es- Recently, it has been reported tliat specific amino acids within this 
domain can be decoded to predict which amino add is eaicoded by the domain. 
Therefore, compariison of isolated A-domain sequences with those in published 
databases will provide added evidence that tire genes ai-e encoding enaymes in 
the ecteinascidin pathway. 

20 

PCR conditions for amplificatioti nfDNA f rpmE. turhinata 

The PCR conditions for toe initial amplification of the genes from £. 
turbinata DNA can be as follows, A to^ reaction volume of 50 microliters 
contains approximately 100 ng of ^. turbinata DMA (either adult or larval), 1 

25 miopomolar of each of two degenerate primers, and Taq poiyinerase arid buffer 
(Boehringer Mannheim Corp., MdianapoHs, M). A PCR protocol is optimized 
for the degenerate ISiRPS primers. The c>'cls conditions start with a "touch down- 
sequence, which lowers the annealing temperature &om 60 to 40''C at a rate of 
2°C per cycle (1 1 cycles), and v/ere then maintains at 40°C for a total of 51 

30 cyx^ies. Cycle steps are as follows: denaturation (94*^0; 1 minute), annealing 
(60°C to 40'^C; 2 minutes), and extension (72''C; 1 minuteX 
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BNAchning 

PCR reactioas are electrophoreses on 0.8% agarose gel and ^'kimlized 
mth ethidium bromide- PCR products of the expected length are cloaed asiog a 
TOPO TA Cioiiiiig kit into the Iiivitrogen pCR®2.1-T0P0 vector as described 
5 by the manufactui-er (Invitrogen Corp., Carisbadf CA), Raconibimiit clones 
containing insert DNA then sequmced using standard protocols. Since 
NRPS enzymes can be modularj clones fiom the degenerate PCR primers can 
represent a pool of fiagments from different KRPS domains, 

Plasmid DNA for sequsnciag is prepared usiiig the Qiagcn QIAprep Spin 
Mhiiprep Kit (Qiagsn, toe, ValeiK-ia, CA). ^^il sequencing is perfonned with an 
ABI automated sequencer (model 373 A) by lising a PRISM Ready Reaction 
D>^Deoxy lenrtiBator cycle sequencing kit as reconunended by the maaufactijrer 
15 (Perkin-Elmer). Cloned gen^ are sequenced v^ing piiniers directed against the 
cloning vector, pCR®2,l-T0P0 (Iiivitrogen, Carlsbad, CA). 

Bxample4 

Sequences of Ecteinascidin Synthesis Acthity from E. titrbtnata that Include 
20 Bacteria 

Cloiied DNA sequences Biay be id«aitifiedby using the BLAST {basic 
local alignment search tool) serv^er of the National Center for Biotechnology 
Monnation accessed over the Memet (Alfcschul et ai., 1997). Clones that are 
identified as having homology to known HRPS genes can be used to probe 
25 libraries, such ^ cosmid libraries, made feom genomic DNA isolated from E. 
turbinata or bacteria of the present inventioa. AJltematiYely> inverse and long- 
mge PCR methods may be used to isolate gmm and gene clusters encoding 
■NRPS genes and gene clusters. 
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Example 5 
Coiubiii atonal Biosya thesis 
Combiitatorial biosjiitiiesis has generally been used in the search for 
novel molecules witti applicatioiis as pharmaceuticals or as platforms for 
5 combmatoriai syndi^is. For example, Shm et al have demonstrated fliat 
engineered aromatic or taodular PKSs cm be used to g^eiute pol>4cetide 
libraries of different molecular sixes axid shapes (Shen et al.j 1999). The 
biosyntbetic genes of the present invention, for example genes for ecteinascidm 
synthesis, could be incorporated into fbese systems to create derivatives/analogs 

10 of ecteinascidins with improved properties such as reduced toxicit>'/m>'algia, 
gieater efficacy etc. (Shen et ah, 1999; Xae et al, 1998). Recently, the 
erythromycin PKS genes have been engineered to efitecrf combinatoriai 
alterations of catal>tio activities in the biosyntbetic pathway (McDadel eit al., 
1999). This has resulted in tlie successfnl generation of more than fiffy 

15 xnaci'olides whidi would otherwise be impractical to produce throngh chemicai 
methods, Hiis leads to &e creation of libraries of novel "unnatural" natural 
products exhibiting altered functions (McDaniel et aL, 1999). The ecteinascidin 
NRPS genes could be used in such a system to create analogs of ecteinascidin 
witli improved properties. 

20 The clonM biosyntbetic genes presented here have application in 

bioprospecting. The cloned PKS genes could be used in PCR, in situ 
hybridizations, etc to isolate novel marine (and terrestiial) ISIKPS and 
ectdnasoidins which may exhibit novel structures and novel activities 
(antibacteriai, antifimgai, anticancer, etc.) 

25 
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Example 6 

The Localizatioa »f Bioactive Secondary Metabolites i» Sponges, 
Ascldi^s aad their Associated Symbionts 

Methods 

5 Adult E. iiirhinata v/ere sliipped by CalBioMarme Tedmologies lac, 

Carlsbad, CA from wild colonies in Florida aiid maintained in static aquaria at 
2S*C for up to 2 weeks. Larvae were either released from the colonies in the 
moming and mdividualJy collected by pipette or excised firom adult individuals 
with a scalpel. Some larvae were collected directly in Sweetings Cay, Bahamas 
10 by dissecting gravid adults withia an hoar of collection. Larvae were collected 
from the adults, eiliier by tank raising colonies or excising gjravid adults. Larvae 
collected by dissectioB of iiidividirals possessed araaage of developmental stages 
(Figiii-e 5). Larvae were washed several tinie« in sterile seawatsi- prior to all 
experiments, 

15 Larval cheniistry. llsree imadred larvae (--300 mg wet weight) were 

extracted in methanol which was dried in vacm to afford 6.9 mg of extract. This 
c^rude extract was dissolved in deut^ated methanol (CD3OD) and analyxed by 
^fMR spectroscopy. The sample was then dried and dissolved in DMSO at a 
concentration of 10 mg'mL and tested m a Human Colon Tumor (HCT) 

20 bioassay. A portion of the extract was dissolved in methanol (1 00 fig/'mL) and 
analyzed using ESI mass spcctrosietty. 

PCR ofE, turbinata bacterial 16S rRNA genes. DNA was extracted 
from either ftesh lar^'ae or lar\'ae. preserved in ethaiioi at 4^0 usiBg a DNeasyldt 
(Qiagea). Portions of the 16S j^HA gene were amplified using "muversal" 

25 enbacteriai primcar pairs: 27f-1492r and i055f-i392r (see Table 1 for aU 
oligonucleotide sequences used) (Amann et al., 1995). PCR reactions were 
performed with 150-200 ng larval gmomic DNA, Taq polymeria, IX Mg 
btiffer, 10 jigBSA, and 250 ^tM dNTP nsing a Hybaid thermocycler. A 30 cycle 
program was used with the following steps: (1) 95''C for 30 seconds for regular 

30 primers or 1 minute for GC chimp primea-s; (2) 50° for 1 minute; (3) 72° for 1 
minnte; and (4) a final elongatiDns step at 72° for 7 minutes. Prior to 
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sequencing, all PGR products were cleaned using a QIAQuicJcJ^ cleajiup Idt. 
Sequencing perfomied using ABI PRISM^* Dye Tsiminator Cycle 
Seqweijoing. Sequences were smlyz&d and aligned using Seqaencher^* aad by 

5 

Table 1 



OKgonutcleotide Seqaences (5' -* 30 ApplicatioK*>^ References 



ET1253r 


GGG UUG CGA. AGO AGO 


F 


Herein 




GAUG(SEQIDN0:2) 






Domain Bacteria 






Eub27f 


TGA GCC AGO ATC AAA 


P 


Amaitm et ah. 




CTCT(SEQIDNO:3) 




1995 


Eab338 


GCTGCCTCCCGTAGG 


F 


Fuchs et ai., 




AGT(SEQIDN0:4) 




1998 


Eubl055f 


ATGGCTGTCGTCAGC 


P 


Amann et al., 




T(SEQIDNO:5) 




1995 


Eubl392r 


CGCCCGCCGCGCCCCGC 


P 


Amann et al., 




GCCCGGCCCGCCGCCCC 




1995 




CGCCCCACGGGCGGTGT 








GTA,C(SEQIDN0:6) 






Eubl492r 


AAG TCG TAA CAA GG A 


P 


Aniajsn. et aL, 




AGCCGTA(SEQID 




1995 


Controls 


H0:8) 






Bjjl253r 


CAT CGC TGC ITC GCA 




Haj'^ood and 




ACC C (SEQ ID NO:9) 




Davidson, 








1997 


Bn240r 


TGC TAT ITG ATG AGC 


P 


Haj'good md 
Davidson, 




CCGCGTT(SEQ]DD 






NO:iO) 




1997 


BncontroJ 


ACGTCACCGTCCAGC 


F 


Haygood and 




CTCT(SEQIDNO:n) 




Davidson, 








1997 


End 855 








Epl247r 


AAG TAG CGG CCC TTT 


P 


Schmidt et 




GTC (SEQ ID NO: 1.2) 




al., 2000 



^ P « PGR primer, F - FISH probe 
^' Oligonucieotide sequence used for FISH experiments were 5' biotiiiyiated 
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Denatarittg gradient gel electrophoresis O^GGE) of bacterial 16S 
rRNA from E. turbmata. DGGE is a technique- that has besn used to separate 
and identify bactmal sequences ftom coraplicated mixtures of microbial 
popalations based on the small siibumt rllNA genes (SSU rRNA) (SaJjai et al, 
5 1996). Ribosomal RNA is an ideal molecule to use forphjiogeiietic studies 
because it is relatively conserved throughout all organistns aad contains 
interspersed variable regions. "UniverKil" primers are used to amplify the 
variable regions of the SSU rRNA gene ftom a mixed population of hactma. 
These amplified gene fra^enls of the same size are tiien separated 
1 0 eiectrophoretically according to sequence di£f«Bnces (primarily GC content) on 
a denaturing gel. The resulting bands caa be excised from the gel, reampUfied 
and seqaeiiced, 

DGGE expenmsatswerepenomied on SSU rRNA gene fragments 
(1055-1392 corresponding to E. coli mxshttm^ amplified from E. turbmaia 

1 5 genomic DH A to study the diversity of associated microbes. PCR products 
containing a GC clamp were cleaned with a QIAQnick kit. The 8% aaylamide 
gel gradients were made with 20-70%, 30-70% and 30-60% denaturant (urea) 
and run in IX TAB buffer at 60**C for either 5 or 14 b.ours {200V or 80V, 
respectively) using a BioRad DCode DGGE system. Bands were staisied for 20 

20 minutes mth Sybre-gold^'** fluorescent stain, documented lismg a Nncieolech gel 
documentation system witli Nucleovision sofl%'are and cut out using sterilized 
razor blades. Each band was incubated in an equivalent voluiBe of 0.5 X TE 
biifla- overnight at ST^C and 5 jiL of this solution was used as a template for 
PCR amplification with i055f and 1392r primes^. FCR products were cleaned, 

25 ethaaol precipitated arid submitted for sequencing. 

Microscopy. For TEM, larvae were fixed in Kamovsky's fixative at 4**C. 
Samples w«are fettlier fixed in 1% osmium tetooxide in cacodylate buffer for 1 
hour, stained with 4% ethanoiic uranyl acetate, dehydrated in an 
ethanol/propylene oxide series, and embedded in Epon resin. Thick sections (1 

30 }im) were obtained on a Reieheit Ultramicrotome and stained vi'ith tokidene 
blue. Thin sections (siiver-gold by interference) were stained with bismuth 
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(0.025 III saturated bismuth per {il water) and 4% efeanolic uraiiyl acetate mid 
examined on a Zeiss TEM. Light m4 confocal OHcroscopic images were 
obtained m ths region near tfce adhesive papillae for orimtBtion, 

Ijnia«noiocaIization experimeats. For inunimogold experiments, 
5 larvae weico fixed, in parafortnaldehyde and then hybridized with the fluorescent 
BUB 338 probe {see FISH section), dehydrated in m ethanol series, embedded in 
LR white resm, cured overnight at 50*C, and sectioned to gold thickness by 
interiFerence, Sections were picked up on parlodion coated nickel grids and 
incubated on drops of blocking bnfFer (TBS, pH. 7.6, 0.8% BSA, 5% noimaj goat 

10 serum, 0.1% sodimn azide) for 30 mimites at room temperature. Gxids were' 
transferred to goat anti-flnorescein antibody conjugated, to 10 nm colloidal gold 
dilnted 1 :25 with gold hviffcs (IBS, 1% normal goat senan, 0, 1% gelatin, 0.2% 
sodinm azide) for 2 hours at morn temperature. Grids wer« washed in buffer 
{blocking buffer withoat goat serum) 2x15 miautes, TBS 2 x 10 minutes, and 

15 distilled water 1 x 5 miaiutes. Tissues containing immanolinked gold were jgjced 
by incubating sections in 2% glutaraldehyde in 0. 1 M Na Cacodylate buffer, pH 
7.4 for 10 minutes. Grids were then cmnrterstained v/ith 2% aqueous uranyl 
acetate for 10 minutes followed by bismuth subnitrate for 10 mimites, and 
examined by TEM. 

20 nmrmcmce in skn hybridization (HSH). The prabes used for the 

hybridization experiments were ihe universal eubacterid probe EUB338 or 
"symbiont specific" probe Etl253r. The syinbiont specific ptobe was designed 
based on the sequence obtained from the two DGGE bands (see above). The 
most variable regions of the 36S gene between 1055 and 1392 were analyzed for 

25 site accessibility and similar sequences using CHECK PROBE (KDP) (Aniann et 
al., 1995; Ftichs et ai, 1998). 

.For controi experiments, the foUowng probes were used: the bryozoan 
.?ymbioiit specific probe {Endohugula sertula) Bnl253r (Haygood and Davidson, 
1997); a sci-amhled probe Bncontrol (tlaygood and Davidson, 1997); fee sponge 

30 symbiont specific probe BntoiheoneUa paiammis i249r (Schmidt et al, 2000);. 
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and no probe controls. The ssquesoces of all probes used are shown iii Table 1 . 
Each probe v.-'as 5' labeled with either fluorescein or CY5. 

The hyhndi;^aTion protcc.>! "-^s :.JaL^tej >o.n lia^.^ood and Davidson 
(Hay^'oad dnd V>AM(ho 199''). Saijp es -"vsei^ ilxeJ in 4"',^ 
5 parafomialdch_)'c'e PBS (phosphate buffered sabne, pH 7,4), dtbydnited tlirough 
m ethaiioi series to 70% and stored at 4''C. Approxitnatsly 10 larvae were 
poooieabiiized with proteinase K, rejBxed wiUi 4% paraformaldehyde in PBS, aid 
blocked witli acetic anhydride. After wasliiag, Qiey were then suspended in pre- 
hybridization buffesr (1 mg total RNA and 100 pg heparin per njL, 1 0% 

iO foimamide, 5X SSC, 0. 1% Twsen 20, 5 wM EDTA and IX Desnhardts' solution) 
and incttbated at 43*C for i hour. For the hybridization step, labeled probe was 
addsd to a conceiitration. of 2 ngl\il, and tlie samples were incubated, at SS^C for 
10 minntes, and then ovemigM (~15 hours) at 43 "C, Sjimples were then washed 
according to Urn following schedule; (1) 2 times for 15 minutes at 43*0 with 

15 high stringency solntion (10% foixnanade, 5 x SSC, and 1% sodium dodecyl 
sulfate); <2) 2 times 15 minutes at ST^'C with wash 2 {10% formamide, 2 x SSC, 
and 1% sodium dodecyl sulfate); (3) and once at room tempesrature for 30 
minute« wth wash 3 (10% fonnamide, 2 x SSC, and 0.1% Tween 20). Ajfter tlie 
fmal wash, the laiTae were suspended and stored iii PBS. For analysis, the 

20 larvae were mounted in Vectasfaield'''^'^* and exanimed with a Zeiss 510 confocai 
laser-scairaing microscope. All excitation wavelengths (488, 568, and 647 mxi) 
were used siniultaneoi^ly to hsl^ visualize the tissue surroimding any signal. 
Images weie obfedned and manipuiated using 3-D for LSM software. 
Resnife» 

25 Chenaistry. The larval ejriracts were v«cy active in the HCT bioassay 

(ICsft = 0.65 fig/n^), winch suggested fee presence of the ecteinascidins. No 
ecteinascidins could be detected by 'H NMR anat^^is of the larval extract. 

Hovi/ever. analysis by ES] b, IS pro-^ idea a parent ion of 744 which cofTssponds to 
Et-743 [M-i-FI]"^, and bombardment of ihai peak produced the fi-agment 494, 
30 which corresponds to 1h& loss of the C-ring subunit (Figure 6). 
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VC&. Ths use of universal eabacterial primers allowed amplification of 

-1400 bp of bacteiiai 16s tRNA sequmces from genomie DNA isolated torn 
lan'ae, although the fields were generally low. The amplified products were 
siibiiutted for sequencing directly, but a clean sequence was not obtained. The 
5 00 bp PGR prnducts were subjected to DOGE aGal>'sis (Figwe 7). lite two 
diSuss bands were cut out md seqasiced. Tlie coaseas!^ rRNA sequence was 
detemnned to be, from 5* to 3*, 

TTGGGTTAAGTCCCGTAACGAGCGCAACCCTTTCCCTTAGTTGCCAGC 
GTGTAAAGACGGGGACTCTGAGGGGACTGCCGGTGATAAACCGGAG 

10 GA.4GGCGAGGACGACGTCAAGTCATCATGGrCcrrACGAGTAGGGCT 
ACACACGTGCTACAATGGTATGTAC-WiGGGAGGCAAAATTGTAAAiV 
TCTAGCA^ATCCr:CAAAAGCAT.4TCTTAGTCCGGATrGA.\GTCTGCA 
ACTCGACTTCATGAAGTTGGAATCGCTAGTAATCGCGAATCAGCATG 
■rCGCGGTGAATACGTTCCCGGGCCTT {SEQ ID NO: 13). 

15 Sequence analysis stowed lihat the two hmds were almo^ identical, and 
different by only four bases/g^s. 

Pfe^logenetic analysis. The -345 bp sequences obtained .firona the 
DGGE bantls wej-e submitted to both BLAST (GeiiBaiik) and KDP online 
analysis programs to fmd the most closely related sequences. T}.ie top S matches 

20 were aligned with the krv'a! sequences and several outgix^ups using 

Sequencher'"*. The related sequences used in the aUgnment and trees were 
uncultured gamma proteobactsrium clone 8-65, Bathymodiohis platifrom, 
Msthyhbacterpsychrqphilus, Metkylmonas sp. LW 1.6, clone 
endosymbtoat of the greenhouse whitefly, Trialeurodes vaporariorum, 

25 Bathymodiolmjaponim&\BalomotmnitHtophilm. Outgroups were ftbrio 
fisdten and Str^tomyces Icivendulae (Figure S). 

MSH» In the FISH experimmts, both the BUB338 and B. s&tula 
Bnl253r probes hybridized to specific cells. Little to no signal was detected 
from the no probe control, E. palauerms 1247r and the scrambled Bncontroi 

30 probe (Figure 9). The ceUs marked by signal were quite large {5-1 0 \Lm) and 
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appeared to be vidian cells, Hgber resolution analysis was attsiiig)ted using 
thick sections but the ceils could mt be identified (Figure 10). 

Due to the posMve result from the E. sertida probe, the specificity of the 
binding was tested using more slTiiigeKt PCR amplification, PCR producte were 
. S obtained for all htyomm samples aj5 espectedj and only firom the eubactenal 
primers for tlie E. Uirhinata samples (Figure 1 i). 

Microscopy. Both iiglit microscopic and TEM analj'sis of the larvae 
suggests fliat there is not a large population of bacteria eitiier witliin or associated 
with the host cells, as the FISH experiments appeared to suggest (Figure 12). A 
10 few bacteria (1-3) could occasionally be seen by TEM wifeia the larvae, but 
external to larval celts. The ^Id-labeled anti-fluorescein antibody esperimenjs 
Mled to localize the cells associated with the origjnal oligo signals. 

Discission 

1 5 The smictiiral similarity of the ecteinaseidinis to knovsTB ixdcrobiai 

metabolites provided compelling circumstantial evidence for a bacteria] source 
of these compounds in ths vidian. By focusing on non-feeding B. turbinata 
larvae, complications such as filter feeding in the adults could be avoided. The 
secondary metabolites of the larvae were deteimined in conjunction with 

20 genomic studies m an attempt to obtain infoimation about the biosynfhetic 
source of the ectein^idin con^unds. 

The chemical analyses showed that Et-743 is present in the larvae m higli 
enougii >ieids that it can be easily dstectetl in crude exti-acts by ESI mass 
spectrometry. Although the exact location of the metabolites is not known, it is 

25 suspected that at least some are associated with the surface of the larvae due to 
the inunediate deteirent effect agaiiist fish upon uptake into the moutii 
(Binghsttii, 1986). 

initial PGR experiments using bacterial primers suggested that bacteria 
were associated with the larvae. FISH experimmts using eubacterial probes also 

30 suggested the presence of bacterial sequences eitliQ: inside or associated wtb 
ascidian host cells. All ths control probes except for the Bnl253r (Endohiguia 
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sertula) probe provided negative results. The presence of £, serttda or closely 
rdated species was tested by PGR anvplification using E. sertula specific 
pdmers. No anipHfication was detected. 

Dae to the relatively abuadant signal infered by the eubacterial probe, it 
5 -was expected that large numbers of bacteria would be detected by TEM atial>^s. 
However, extremely few morphologically dis&ict bacteria could be visualized 

ft is possible i3iat there are bact^al symbionts associated with the 
ascidiao larvae in small aumbers or with only adults feat produce Et-743. Th^ 
are some invertebrates that contain obligate symbionts as adults but do not 

10 transmit tiiem to their offspring. For example, sepiolid squids contain symbiotic 
bacteria (Vibrio Jischeri) in their liglit organs that are responsible for 
biolnminesceiit light jproducSon, but the symbionts are not vertically transtxiitted 
to their eggs, lasteadj the young sqiixds selectively obtain the same sjmbioat 
from the ambient seawater (Nyholm et al., 2000), Another mode of vertical 

1 5 symbioat transfer is for the adult to brood eggs aposymbioticaily and then allow 
the passage of bacteria onto them as they pass specific organs into the 
snvirosm^t (Giere et ai,> 1991). Some larvae were excised firom E. turbinata 
for tiiis study prior to natural rele^e and could therefore be aposymbiotic due to 
this possible mode of s^mbiont trai^sfer. However, TEM and PISH experiments 

20 conducted %'ith both "natural" lan'ae and dissected samples provided identical 
results in all cases. 

The ecological and biological fimctions of Et-743 are still largely 
lOiknown, although fhea-e is some evidence to surest that they serve as a feeding 
detment for the larvae (Yomig and Bin^am, 1987). Feeding experiments using 

25 adult extracts showed that they woe only deterrent when the protein level of the 
test foods matched the extremely low concmtration in the ascidian (Kickii^ter, 
2000), Other possible &ncti€«QS include prevention of bacterial or viral 
infection, settlement cues, and/or prevention of encroacliment of nearby species. 
Some workers have also found that the tiger fialwomi Psemiocems crozeri feeds 

30 exclusive ly apoa adult E. turbinata and seleetiveiy sequesters some of the 
ecteinascidins (Wright, 1999), 
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Example 7 

Cfearaeterizatloa of Putative Bacterial SymbioBts of Ecteinmcidiu tttrbmata 
DNA was isolated kom Ecteinasctdia tifrbinata (EX) larvae stored in 
5 proteiRasefc(fromtheQiageiiDNea^kit)at4''C. From fresh ETs, numerous 
larvae were collected for fixing for FISH, RNA isoktion, DMA isolation, and 
DAPI stajBing. C-sections wax; performed and nomerous larvae in diSercmt 
developmental stages were collected and stored, as well as the adults, in e&anol 

10 Results 

9.5 |ig RNA was isolated from vmious batch&s of larvae. 
DAPI staining showed jfluorescence in small round dotii of consistent 
siaie. A second DAPI steiiihjgvs^as performed nsingethanol-preservedlan-'ae in 
order to control for potential aitto-fiaorescence by udng an unstained control. 

1 5 Wliite no autofluorescsnce was seen m ttie second DAPI, the dots were not 
preseat in the stained samples, 

PCR experiments weie performed mbxg, 130-190 ng of larval DNA and 
the 105 5f/l 3 92r primers without the GC clamp; and, because tiie 338 probe 
sequence gave a positive signal in FISH (Exaaiple 6), in a separate reaction the 

20 3 3 Sfl' i 392r (non-GC ciaiap) primers were used. PCR conditions were identical 
to those in Example 6, The PCR products were reainplified iising the 
1055£'1392r prim^, with a GC clanap on the reverse primer, and several 
reactions produced strong bands. 

The products were cleaned with an etiiaaol precipitation, quantified, and 

25 a DGGB with a 30-70^'o gradient on a S% gel was performed with both a 
negative and positive (E, coif) co«h-ol. As in Exaniple 6, samples ^ave rise to 
two wide baiids on the DGGE gel, in addition to other bands. Seventeen bands 
were removed from the DGGE geL DNA ftoni the bands was purified and 
reamplified with the 1055£^1392r piinsers. Following clean-up, either 50 ng or 

30 90 ngofDNA DNA was snbj^ted to sequencing. Sequence of &e two bands 
was, 5' to 3', 
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BandUc 

TATGGCTGTCGTTCAGCTCGTGTCGTGA.GATGTTGGGTTAAGTCCCGT 
AACGAGCGCAACCCTTTCCCTTAGTTGCCAGCGTGTAAAGACGGGGA 
CTCIGAGGGGACTGCCGGTGATAAACCGGAGGAAGGCGAGGAOGAC 

5 GTCAWCATCATGGTCCTTACGAGTAGGGCTACACACGTGCTACAA 
TGGTATGTACCAAGGGAGGCA4]vlArTGT.A\A\TCT.AilCAAATCCCC 
MAAAGCATATCTTAGTCCGGATTGAAGTCTGCAACTCGACTTCATGA 
AGTTGGAATCGCTAGTAATCGCGAATCAGCTATGTCGCGGTGAATAC 
GITCCCGGGCCTTGTACACACCG (SEQIDNO:14); md 

10 

Band lib 

■ TTAGTTGCAGCGTGTAAGACGGGGACTCTGAGGGGACTGCCGGTGAT 
AAACCGGAGGAAGGCGAGGACGACGTCAAGTCATCATGGTCCTTAC 

GAGTAGGGCTACACAC<jIGCTACA4TGGTATGTACA.\AGGGAGGCA 
15 A^\ATTGTAAa^ATCTAGC:.AA.ATCCCCAA4AGCATAXCTrAGTCCGGAT 
TGA.AGTCTGCAACrC:GAClTCATGAAGTTGGAATCGCTAGTA\TCGC 
QAATCAGCATGTCGCGGTGAATACGTTCCCGGGCCTTGTACACAC 
(SEQIDKOilS). 

20 The consensus sequence &om these baiuis was exactly as that of 

Exiitnple 6. Using this data, several phylogenetic trees using both distance md 
parsimony methods were constructed, and m all cases, the DGGE sequences 
were tiiost closely related to two s>'mbionts of the whitefiy as described in 
Example 6, The sequences were also related to several Haiomonas species, 

25 

AM poblications, including patent documents and scimtific articles, 
referred to in this appHcation, including any bibiiographyj are incorporated by 
reference in their fflatir^ for all pmposes to the same extent as if each individual 
publication were individnaliy incorporated by reference. 

30 
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WHAT XS CLAIMED IS: 

1 , A compositioB, comprising a bacteria isolated &om Ecteinascidia 
turbiyiata. 

5 

2. The compositiott of claim 1 , wherein said bacteria comprises at least one 
activity that is involved in the production of aa ^teinasddia or a precuisor 
thereof. 

10 3. A composition, comprisisig: at least one nudeic acid molecule that 
encodes at least one poiypeptide tliat c-atalyzes at least one step in the s>'n&esis 
of at least one ecteinascidia, wher^n said at least one nucleic acid is derived 
from St least one marine organism. 

15 4. The cjoinposition of claim 3, wfeerein said at Is&si one polypeptide 
comprises at least one acti^'ity of at least one non-ribosomal peptide synthase. 

5. The composition of claim 3, wherein said at least one mai ine organism 
comprises at least one bacteria, 

20 

6. The composi tion of claim 3, wfaea-ein said at le^t one marine organism 
comprises at least one invertebrate. 

7. The composition of claim 6, whexein said at least one invertebrate 
25 comprises at least one BcteitiasdMa. 

8. The composition of claim 7, wha-ein said at least one Ecteinascidia is 
Ecteinascidia turbinata, 

30 9. "^is composition ofclaijn 3, wherein said at least one aticleic acid 
molecule tariha: comprises at least one expression control sequence. 
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1 0. The composition of claim 3, wheredti said nucleic acid molecule is in a 

vector, 

i 1 . Tlie coiBpositiqn of claim 1 0, wherein said vector is wiMn a cell 

5 

12. The compositioii of claim 3 , wherein said at least one micieic acid 
molecule is within a cell. 

13. A composition, comprising a library of aucleic acid molecules of elasni 3, 

10 

14. A composition, comprising; at least one pol>'peptide that catalyses at 
least one step in the synlhesis of at h&si one eetein^cidin, wherein said at least 
one polypq>t ide is derived from at least one marine organism. 

15 15. The composition of claim 14, wherein said at least one ecteinascidia 
comprises Et-743. 

1 6. The cotriposition of claim 14, wherein said at least one marine organism 
comprises at least one bacteria, 

20 

1 7. The composition of claim 14, wherein said at least one marine organism 
comprises at least one invertebrate, 

1 8. The compositiott of claim 17, wherein said at least one invertebrate 
25 cornprises at least one Ecteinascidia. 

19. Hie composition of claim 1 8, wherein said at least one Ecteinascidia is 
Ecteinascidia turbimta. 

30 20, The composition of claim 14, wherdn said at least one polypeptide is 
within a eelL 
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21 . A coij^osition, compiisiag a library of polypeptides of claim 14. 

22. A composMon comprising an aRtibody or an active fi:agme]3it thereof that 
binds with or specifically binds with a bacteria of ciaim 1 , 

5 

23. A composition compiisijig an antibody or aa active fragment Hiereof that 
binds wiife or specifically binds with a polypeptide of claim 14. 

24. A method of making a composition^ comprising: providiiig at least one 
1 & composition of claim 1 , and synibesizing at least one ectdnascidin. 

25. A composition made by the method of claim 24, 

26. The composition of claim 25, wherein said composition does not 
15 comprise a known et^einascidin. 

27. The composition of claim 24. comprising at least one pharmaceuticaUy 
acceptable carrier. 

20 28. The composition ofciaim 24, wherein said composition is a 
pharmaceutical composition. 

29. A method of making a composition, compri sii^: providing at least one 
composition of clann 3, and synthesizing at least one ©cteinascidin. 

25 

30. A compoaition made by th& method of claim 29. 

3 1 . The composition of claim 30, wherein said composition does not 
comprise a known biyostatin. 

30. 
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32. Tlie compositicai of claim 30, coraprising at least one pharmacettticaiiy 
acceptable carrier, 

33. The composition of claim 30, wtterein said composition is a 
5 phmiBaceutical composition. 

34. A metSiod of maldag a composition, comprising: providing at least one 
compositioK of claim 14, and syntih^ising at least one ecteinasci^Jin, 

10 35 . A composition made by the method of claim 34. 

36. The composition of claim 35, wherein said composition does not 
comprise a known ecteinascidin. 

15 37. Tbe composition ofclaim 35, comprisuig at least one pharmaceutically 
acceptable carrier. 

35. The composition of claim 35, wherein said composition is a 
pharmaceutical composition. 

20 

39, A method for identil>'ing at least one nucleic acid molecule eacodiag at 
least one po1>pq)tide that catalyzes at least one st&p in the synthesis of an 
ecteinascidin , comprising: contactiag a nucleic acid molecule of claim 1 with a 
sample, and identifying nucleic acid molecules in said sample that hybridize vnfh 

25 said nucleic acid molecnle of claim 1 . 

40, Hie method of claim 39, wherein said sample is derived at least in part 
from an enviroxanefttal sample, 

30 41, The method of claim 40, wherein said environmental sample is d&nved at 
least in part from a marine emironment. 
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42. A niicjeic acid molcscule ideaMed by the metiiod of claim 39. 

43. The mcldc acid molecule of clma 42, comprising an expression control 

sequence. 

5 

44. Hie OTcieic acid molecale of claim 42 m a vector. 

45. The nucleic acid molecide of claim 42 in a ceU, 

10 46. A oomiKJsilioncomprisiiig a library of imcleic add molecules of 
claim 42. 

47. A method for ideniiiying a faioactive compound, comprismg: contacting a 
composition of claim 1 with an in vitro, ex vivo or in vivo detection system and 

1 5 determimng the bioaotivity of said compound, 

48. A bioactive compound identified by the melhod of claim 47, 

49. The bioactive con^imd of claim 48 in a pharraaceutically acceptable 
20 carries'. 

50. The bioactive compound of claini 48, wherein said bioactive compound 
is a pharmaceutical oompoand, 

25 51 , A method Jfor idesti^fying a bioactive compotmd, comprising: contacting a 
composition of claim 3 with an in ^4tro, ex vfvo or m vivo detection system and 
detoinimng the bioacti\% of said componnd. 

52. A bioactive eompoimd identified by the method of claim 51. 

30 
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53. The bjoactive compound of claim 52 in a pliamacenticaliy acceptable 

canier. 

54. Tiie bioactive compound of claim 52, whrareiti said bioactive compound 
5 is a pharmaceutical compound, 

55. A mefliod for idmtifying a bioactive compoimd, comprising: contacting a 
composition of claim 14 with an in vitro, ex vivo or in vivo detection systcan and 
detrarmitiing thebioactivity of said compound. 

10 

56. A bioactive compoand identiiied by the method of claim 55. 

57. Tks bioactive compound of claim 56 in a phannaceuticaily acceptable 
cairiear. 

15 

58. The bioactive compound of claim 56, wherein said bioactive compound 
is a pharmaceutical compound. 
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